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EXTRACELLULAR MATRIX AND ADHESION-ASSOCIATED PROTEINS 

TECHNICAL FIELD 
This invention relates to nucleic acid and amino acid sequences of extracellular matrix and • 
5 adhesion-associated proteins and to the use of these sequences in the diagnosis, treatment, and 
prevention of cell proliferative, immune, reproductive, neuronal, and genetic disorders. 

BACKGROUND OF THE INVENTION 

Extracellular Matrix Proteins 

10 The extracellular matrix (ECM) is a complex network of glycoproteins, polysaccharides, 

proteoglycans, and other macromolecules that are secreted from the cell into the extracellular space. 
The ECM remains in close association with the cell surface and provides a supportive meshwork that 
profoundly influences cell shape, motility, strength, flexibility, and adhesion. In fact, adhesion of a cell 
to its surrounding matrix is required for cell survival except in the case of metastatic tumor cells, which 

15 have overcome the need for cell-ECM anchorage. This phenomenon suggests that the ECM plays a 
critical role in the molecular mechanisms of growth control and metastasis. (Reviewed in Ruoslahti, E. 
(1996) Sci. Am. 275:72-77.) Furthermore, the ECM determines the structure and physical properties 
of connective tissue and is particularly important for morphogenesis and other processes associated . 
with embryonic development and pattern formation. , 

20 

Collaceas 

The collagens comprise a family of ECM proteins that provide structure to bone, teeth, skin, 
ligaments, tendons, cartilage, blood vessels, and basement membranes. Multiple collagen proteins have 
been identified. Three collagen molecules fold together in a triple helix stabilized by interchain disulfide 

25 bonds. Bundles of these triple helices then associate to form fibrils. Collagen primary structure 
consists of hundreds of (Gly-X-Y) repeats where about a third of the X and Y residues are Pro. 
Glycines are crucial to helix formation as the bulkier amino acid side chains cannot fold into the triple 
helical conformation. Because of these strict sequence requirements, mutations in collagen genes have 
severe consequences. Osteogenesis imperfecta patients have brittle bones that fracture easily; in severe 

30 cases patients die in utero or at birth. Ehler-Danlos syndrome patients have hyperelastic skin, 

hypermobile joints, and susceptibility to aortic and intestinal rupture. Chondrodysplasia patients have 
short stature and ocular disorders. Alport syndrome patients have hematuria, sensorineural deafness, 
and eye lens deformation. (See Isselbacher, K.J., et al. (1994) Harrison's Principles of Internal 
Medicine , McGraw-Hill, Inc., New York, NY, pp. 2105-21 17; and Creighton, T.E. (1984) Proteins, 



BNSDOCID: <WO 0068380 A 2_L> 



WO 00/68380 . PCT/US00/12811 

Structures and Molecular Principles , W.H. Freeman and Company, New York, NY, pp. 191-197.) , 

Collectins are extracellular proteins with collagen tails and globular lectin domains that play an 
important role in the first line immune response to micoorganisms. The peripheral lectin domain 
permits binding to sugar residues on microorganisms, while the collagen tail interacts with phagocyte 
5 receptors or the complement system. Examples of collectins are the pulmonary surfactant proteins SP- 
A and SP-D ( Kuroki, S.D. et al. (1998) J. Biol. Chem. 273:4783-4789). 

Elastin 

Elastin and related proteins confer elasticity to tissues such as skin, blood vessels, and lungs. 

10 Elastin is a highly hydrophobic protein of about 750 amino acids that is rich in proline and glycine 

residues. Elastin molecules are highly cross-linked, forming an extensive extracellular network of fibers 
and sheets. Elastin fibers are surrounded by a sheath of microfibrils which are composed of a number ' 
of glycoproteins, including fibrillin. Mutations in the gene encoding fibrillin are responsible for 
Marfan's syndrome, a genetic disorder characterized by defects in connective.tissue. In severe cases, 

15 the aortas of afflicted individuals are prone to rupture. (Reviewed in Alberts, B., et al. (1994) 
Molecular Biology of the Cell . Garland Publishing, New York, NY, pp. 984-986.) 

Fibronectin 

Fibronectin is a large ECM glycoprotein found in all vertebrates. Fibronectin exists as a dimer 
20 of two subunits, each containing about 2,500 amino acids. Each subunit folds into a rod-like structure 
containing multiple domains. The domains each contain multiple repeated modules, the most common 
of which is the type III fibronectin repeat. The type III fibronectin repeat is about 90 amino acids in 
length and is also found in other ECM proteins and in some plasma membrane and cytoplasmic 
proteins. Furthermore, some type III fibronectin repeats contain a characteristic tripeptide consisting of 
25 Arginine-Glycine-Aspartic acid (RGD). The RGD sequence is recognized by the integrin family of cell 
surface receptors and is also found in other ECM proteins. Disruption of both copies of the gene 
encoding fibronectin causes early embryonic lethality in mice. The mutant embryos display extensive 
morphological defects, including defects in the formation of the notochord, somites, heart, blood 
vessels, neural tube, and extraembryonic structures. (Reviewed in Alberts, supra , pp. 986-987.) 

30 

Laminin 

Laminin is a major glycoprotein component of the basal lamina which underlies and supports 
epithelial cell sheets. Laminin is one of the first ECM proteins synthesized in the developing embryo. 
Laminin is an 850 kilodalton protein composed of three polypeptide chains joined in the shape of a 
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cross by disulfide bonds. Laminin is especially important for angiogenesis and, in particular, for 
guiding the formation of capillaries. (Reviewed in Alberts, supra, pp. 990-991 .) 

Prnteoclvcans 

5 There are many other types of proteinaceous ECM components, most of which can be classified 

as proteoglycans. Proteoglycans are composed of unbranched polysaccharide chains 
(glycosaminoglycans) attached to protein cores. Common proteoglycans include aggrecan, betaglycan, 
decorin, perlecan, serglycin, and syndecan-1. Some of these molecules not only provide mechanical 
support, but also bind to extracellular signaling molecules, such as fibroblast growth factor and 

1 0 transforming growth factor p\ suggesting a role for proteoglycans in cell-cell communication. 

(Reviewed in Alberts, supra, pp. 973-978.) Likewise, the glycoproteins tenascin-C and tenascin-R are 
expressed in developing and lesioned neural tissue and provide stimulatory and anti-adhesive 
(inhibitory) properties, respectively, lor axonal growth (Faissner, A. (1997) Cell Tissue Res. 290:331- . 
341). 

15 Dentin phosphoryn (DPP) is a major component of the dentin ECM. DPP is a proteoglycan 

that is synthesized and expressed by odontoblasts (Gu, K., et al. (1998) Eur. J. Oral Sci. 106:1043- 
1047). DPP is believed to nucleate or modulate the formation of hydroxyapatite crystals. The gene 
encoding DPP has been mapped to human chromosome 4. Chromosome 4 contains the gene loci for 
two dentin genetic diseases, dentinogenesis imperfecta type 11 and dentin dysplasia type 11 (Feng, J.Q., 

20 et al. (1998) J. Biol. Chem. 273:9457-9464). 

Mucins ' 

Mucins are highly glycosylated glycoproteins that are the major structural component of the 
mucus gel. The physiological funcuons of mucins are cytoprotection, mechanical protection, 

25 maintenance of viscosity in secretions, and cellular recognition. MUC6 is a human gastric mucin that is 
also found in gall bladder, pancreas, seminal vesicles, and female reproductive tract (T oribara, N.W., et 
al. (1997) J. Biol. Chem. 272:16398-16403). The MUC6 gene has been mapped to human 
chromosome 1 1 (Toribara, N.W., et al. (1993) J. Biol. Chem. 268:5879-5885). Hemomucin is a novel 
nrosophila surface mucin that may be involved in the induction of antibacterial effector molecules 

30 (Theopold, U., et al. (1996) J. Biol. Chem. 217:12708-12715). 

1 .ink Protein 

Link proiein binds to both cartilage proteoglycan and hyaluronan in cartilage ECM. This 
binding stabilizes the aggregation of these cartilage ECM proteins and produces supramolecular 
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assemblies. Link protein has been detected in other connective tissues, where it may bind proteoglycans 
and hyaluronan. Link protein contains a signal peptide, an immunoglobulin repeat, and link repeats - ' 
(Ayad, S. ; et al. (1994) The Extracell ular Matrix Facts Rook Academic Press, Inc., San Diego, CA, 
pp. 120-121). 

5 

Adhesion-Associated Proteins 

The surface of a cell is rich in transmembrane proteoglycans, glycoproteins, glycolipids, and 
receptors. These macromolecules mediate adhesion with other cells and with components of the ECM. 
The interaction of the cell with its surroundings profoundly influences cell shape, strength, flexibility, 
10 motility, and adhesion. These dynamic properties are intimately associated with signal transducdon 
pathways controlling cell proliferation and differentiation, tissue construction, and embryonic 
development. 

Cadherins 

15 Cadherins comprise a family of calcium-dependent glycoproteins that function in mediating 

cell-cell adhesion in virtually all solid tissues of multicellular organisms. These proteins share multiple 
repeats of a cadherin-specific motif, and the repeals form the folding units of the cadherin ECM. 
Cadherin molecules cooperate to form focal contacts, or adhesion plaques, between adjacent epithelial 
cells. The cadherin family includes the classical cadherins and protocadherins. Classical cadherins 

20 include the E-cadherin, N-cadherin, and P-cadherin subfamilies. E-cadherin is present on many types 
of epithelial cells and is especially important for embryonic development. P-cadherin is present on cells 
of the placenta and epidermis. Recent studies report that protocadherins are involved in a variety of 
cell-cell interactions (Suzuki, S. T. (1996) J. Cell Sci. 109:2609-261 1). The intracellular anchorage of 
cadherins is regulated by their dynamic association with catenins, a family of cytoplasmic signal 

25 transduction proteins associated with the actin cytoskeleton. The anchorage of cadherins to the actin 
cytoskeleton appears to be regulated by protein tyrosine phosphorylation, and the cadherins are the 
target of phosphorylation-induced junctional disassembly (Aberle, H., et al. (1996) J. Cell. Biochem. 
61:514-523). 

30 Integrins 

Integrins are ubiquitous transmembrane adhesion molecules that link the ECM to the internal 
cytoskeleton. Integrins are composed of two noncovalently associated transmembrane glycoprotein 
subunits called a and p\ Integrins function as receptors tfiat play a role in signal transduction. For 
example, binding of integrin to its extracellular ligand may stimulate changes in intracellular calcium 



BNSDOCID: <WO 0O66380A2_l_> 



PCTAJS00/12811 

WO 00/68380 

,evels or proiein kinase activity (Sjaastad, M.D. and Nelson, W.J. (1997) BioEssays 19:47-55). At 
least ten cell surface receptors of the iniegrin family recognize the ECM component fibronectin, which 
is involved in many different biological processes including cell migration and embryogenesis 
(Johansson, S., et al. (1997) Front. Biosci. 2.D126-D146). 

5 

Lectins 

Lectins comprise a ubiquitous family of extracellular glycoproteins which bind cell surface 
carbohydrates specifically and reversibly, resulting in the agglutination of cells. (Reviewed in 
Drickamer, K. and Taylor, M.E. (1993) Annu. Rev. Cell Biol. 9:237-264.) This function is 
10 particularly important for activation of the immune response. Lectins mediate the agglutination and 
mitogen* stimulation of lymphocytes at sites of inflammation (Lasky, L.A. (1991) J. Cell. Biochem. 
45:139-146; Paietta, E et al. (1989) J. Immunol. 143:2850-2857). 

Lectins are further classified into subfamilies based on carbohydrate-binding specificity and . 
other criteria. The galectin subfamily, in particular, includes lectins that bind P- g alactoside 
15 carbohydrate moieties in a thiol-dependent manner. (Reviewed in Hadari, Y.R., et al. (1998) J. Biol. 
Chem. 270:3447-3453.) Galectins are widely expressed and developmentally regulated. Because all 
lectins lack an N-terrninal signal peptide, it is suggested that galectins are externalized through an 
atypical secretory mechanism. Two classes of galectins have been defined based on molecular weight 
and oligomerization properties. Small galectins form homodimers and are about 14,16 kilodaltons in 
oo mass, while larce galectins are monomeric and about 29-37 kilodaltons. 

Galectins contain a characteristic carbohydrate recogntion domain (CRD). The CRD is about 
140 amino acids and contains several stretches of about 1-10 amino acids which are highly conserved, 
amone all aalectins. A particular 6-amino acid motif within the CRD contains conserved tryptophan 
and areinine residues which are critical for carbohydrate binding. The CRD of some galectins also 
25 contains cysteine residues which may be important for disulfide bond formation. Secondary structure 
predictions indicate that the CRD forms several p-sheets. 

Galectins play a number of roles in diseases and conditions associated with cell-cell and cell- 
matrix interactions. For example, certain galectins associate with sites of inflammation and bind to cell 
surface immunoglobulin E molecules. In addition, galectins may play an important role in cancer 
30 metastasis. Galectin overexpression is correlated with the metastatic potential of cancers in humans 

and mice. Moreover, anti-galectin antibodies inhibit processes associated with cell transformation, such 
as cell aggregation and anchorage-independent growth. (See, for example. Su, Z.-Z.. et al. (1996) Proc. 
Natl. Acad. Sci. USA 93:7252-7257.) 



5 
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Selectins 

Selectins, or LEC-CAMs, comprise a specialized lectin subfamily involved primarily in 
inflammation and leukocyte adhesion. (Reviewed in Laskv: supra .) SelecUns, which mediate the ' 
recruitment of leukocytes from the circulation to sites of acute inflammation, are expressed on the 
5 surface of vascular endothelial cells in response to cytokine signaling. Selectins bind to specific ligands 
on the leukocyte cell membrane and enable the leukocyte to adhere to and migrate along the endothelial 
surface. Binding of selectin to its ligand leads to polarized rearrangement of the actin cytoskeleton and 
stimulates signal transduction within the leukocyte (Brenner, B., et al. (1997) Biochem. Biophys. Res. 
Commun. 231:802-807; Hidari, K.I., et al. (1997) J. Biol. Chem. 272:28750-28756). Members of the 

1 0 selectin family possess three characteristic motifs: a lectin or carbohydrate recognition domain; an 
epidermal growth factor (EGF)-like domain; and a variable number of short consensus repeats (scr or 
"sushi" repeats) which are also present in complement regulatory proteins. The selectins include 
lymphocyte adhesion molecule- 1 (LAM-1 or L-selectin), endothelial leukocyte adhesion molecule- 1 
(ELAM-1 or E-selectin), and granule membrane protein- 140 (GMP-140 or P-selectin) (Johnston, G.I., 

15 etal. (1989) Cell 56:1033-1044). 

Attractin . 

Attractin is a 1 34 kilodalton glycoprotein found in the serum. It is a member of the CUB 
family of cell adhesion proteins and binds directly to leukocytes. Attractin has a CUB domain, an EGF 
20 domain, and C-type lectin protein domains. This serum protein mediates the interaction between T 
lymphocytes and monocytes and leads to the adherence and spreading of monocytes that become the 
foci for T cell clustering. (See, Duke-Cohan, J.S., et al. (1998) Proc. Natl. Acad. Sci. USA 95:1 1336- 
11341.) 



25 Proteins Containing Leucine Rich Repeats (LRRs) 

LRRs are sequence motifs, approximately 22-28 amino acids in length, found in proteins with £ 
large variety of functions and cellular locations. Proteins containing LRRs are all thought to be 
involved in protein-protein interactions. The crystal structure of LRRs has been studied and found to 
correspond to beta-alpha structural units. These structural units form a parallel beta sheet with one 

30 surface exposed to solvent. In this way an LRR-containing protein acquires a nonglobular shape 

(Kobe, B. and Deisenhofer, J. (1994) Trends Biochem. Sci. 19:415-421). There is evidence to suggest 
LRRs function in signal transduction and cellular adhesion as well as in protein-protein interactions 
(Gay, N.J., et al. (1991) FEBS Lett. 29:87-91). For example, LLR proteins such as connectin and 
chaoptin are important cell adhesion molecules in neuronal development in Drosophilia melanpgaster . 



6 
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and mammalian homologs are found in mouse (Taguchi, et al. (1996) Brain Res.Mol. Brain Res. 1- 
2:31-40). 

Proteins Containing ArmadilW ft-Caienin-like Repeats 

5 Various proteins such as those encoded by the Drosophila armadillo gene and the human APC 

E ene contain amino acid repeats that interact with p-catenins. The armadillo gene is required for 
pattern formation within the embryonic segments and imaginal discs and is highly conserved. It is 63% 
identical to a human protein, plakoglobin, which is involved in adhesive junctions fining epithelial and 
other cells (Peifer, M. and Wieschaus, E. (1990) Cell 63:1 167-1176). APC gene mutations appear to ' 

,0 initiate inherited forms of human colorectal cancer and sporadic forms of colorectal and gastric cancer 
(Rubinfeld, B., et al. (1993) Science 262:1731-1734). The fact that the protein encoded by APC 
interacts with catenin suggests a link between tumor initiation and cell adhesion (Su. L.K., et al. (1993) 
Science 262:1734-1737). 

15 Proteins Containing C-tv p p * n>omai" s 

C-type lectin domains are found in a variety of proteins, including selectins and lecticans. 

Lecticans are a family of chondroitin sulfate proteoglycans that include aggrecan, versican. neurocan, 

and brevican. All C-type lectin proteins are involved in protein-protein interactions (Aspberg. A., et al. 

(1997) Proc. Natl. Acad. Sci. USA 94:101 16-10121): A-novel macrophage-rest/icted C-type lectin 
20 protein has been cloned from mouse tissue. It is a type 11 transmembrane protein with one extracellular 

C-type lectin domain (Balch, S.G., et al. (1998) J. Biol. Chem. 273:18656-18664). 

Bvstin 

Bystin is a cytoplasmic protein that binds directly to trophinin, a cell adhesion molecule, and 
25 tastin. The three molecules form a complex that is involved in cell adhesion. Bystin, tastin, and 

trophinin are strongly expressed in cells involved in the implantation of embryos, specifically in cells at 
human implantation sites and in intermediate trophoblasts at the invasion front of the placenta in early 
pregnancy. Bystin also binds to cytokeratins. During early embryogenesis cytokeratins 8 and 1 8 are 
expressed in the trophectoderm of blastocytes. It is possible that the molecular complex formed by 
30 bystin, tastin, and trophinin interacts with the cytokeratins of trophectoderm cells at the time of 
implantation. A key component of embryo implantation is the unique cell adhesion to endometrial 
epithelium that occurs and the subsequent invasion of the maternal tissue by the trophoblast. Bystin 
may have an important role in the signal transduction that links cell adhesion to proliferation (Suzuki, 
N., et al. (1998) Proc. Natl. Acad. Sci. 95:5027-5032). 
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Src-homology 3 (SH3) Domain-Containing Proteins 

SH3 is a 60-70 amino acid motif found in a variety of signal transduction and cytoskeletal / 
proteins. The SH3 domain is involved in mediating protein-protein interactions. Evidence suggests that 
the SH3 domains recognize a family of related domains or proteins in a variety of different tissues and 
5 species. One novel SH3 domain-containing protein is the 52 kilodalton focal adhesion protein (FAP52 
or p52). FAP52 is localized to focal adhesions, specialized membrane domains in cultured cells that 
mediate the attachment of cells to the growth substratum and ECM. Focal adhesions consist of 
structural proteins, integrins, regulatory molecules, and signaling molecules and are involved in cell 
signaling. FAP52 may form part of this multimolecular complex that comprises focal adhesion sites 
10 (Merilainent, J., et al. (1997) J. Biol. Chem. 272:23278-23284). 

The ECM plays an important role in cell invasive processes such as angiogenesis and tumor 
metastasis (Ruoslahti, supra). In particular, the glycoproteins laminin and fibronectin are implicated in 
the migration of tumor cells through the ECM (chemotaxis) in the course of metastasis of tumors to 
other tissues. The same process, chemotaxis, also promotes the migration of vascular endothelial cells 
15 to form new microvascular networks to support these tumors (tumor angiogenesis). 

The discovery of new extracellular matrix and adhesion-associated proteins and the 
polynucleotides encoding them satisfies a need in the art by providing new compositions which are ' 
useful in the diagnosis, prevention, and treatment of cell proliferative, immune, reproductive, neuronal, 
and genetic disorders. 

20 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, extracellular matrix and adhesion-associated 
proteins, referred to collectively as "EXMAET and individually as "EXMAD-1," "EXMAD-2, rr 
"EXMAD-3," "EXMAD-4," "EXMAD-5," "EXMAD-6/' "EXMAD-7,", "EXMAD-8," "EXMAD- 

25 9," "EXMAD-10," "EXMAD-1 1," "EXMAD-1 2," "EXMAD-13," "EXMAD-14," "EXMAD-15," 
"EXMAD-16," "EXMAD-17," "EXMAD-18," "EXMAD-19," "EXMAD-20," 4< EXMAD-21,' ? 
"EXMAD-22," "EXMAD-23," "EXMAD-24, ?? and "EXMAD-25." In one aspect, the invention 
provides an isolated polypeptide comprising a) an amino acid sequence selected from the. group 
consisting of SEQ ID NO:l-25, b) a naturally occurring amino acid sequence having at least 90% 

30 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -25, c 
a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ IE 
NO: 1-25, or d) an immunogenic fragment of an amino acid sequence selected from the group consistin 
of SEQ ID NO:l -25. In one alternative, the invention provides an isolated polypeptide comprising the 
amino acid sequence of SEQ ID NO: 1-25. 

8 
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The inveniion farther provides an isolated polynucleotide encoding a polypeptide comprising a) 
an amino acid sequence selected from the group consisting of SEQ ID NO:l-25. b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-25. c) a biologically active fragment of an amino 
5 acid sequence selected from the group consisting of SEQ ID NO:l -25 , or d) an immunogenic fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO:l-25. In one alternative, 
the polynucleotide is selected from the group consisting of SEQ ID NO:26-50. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising a) an amino acid 
10 sequence selected from the group consisting of SEQ ID NO: 1 ^25, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l -25, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l -25, or d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO:l -25. In one alternative, the invention 
15 provides a cell transformed with the recombinant polynucleotide. In another alternative, the invention 
provides a transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising a) an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-25, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
20 consisting of SEQ ID NO: 1-25, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-25, or d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1 -25 . The method comprises a) culturing a , 
cell under conditions suitable for expression of the polypeptide, wherein said cell is transformed with a 
recombinant polynucleotide comprising a promoter sequence operably linked to a polynucleotide 
25 encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising a) an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
25, t» a naturally occurring amino acid sequence having at least 90% sequence identity to an amino 
acid sequence selected from the group consisting of SEQ ID NO:l -25, c) a biologically active fragment 
30 of an amino acid sequence selected from the group consisting of SEQ ID NO.l -25, or d) an 

immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l- 

25. 

The invention further provides an isolated polynucleotide comprising a) a polynucleotide 
sequence selected from the group consisting of SEQ ID NO:26-50, b) a naturally occurring 
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polynucleoUde sequence having at leasi 907c sequence identity to a polynucleotide sequence selected, 
from the group consisting of SEQ ID NO.26-50, c) a polynucleotide sequence complementary to a), or 
d) a polynucleotide sequence complementary to b). In one alternative, the polynucleoUde comprises at 
least 60 contiguous nucleotides. 
5 Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 

said target polynucleoUde having a sequence of a polynucleoUde comprising a) a polynucleotide 
sequence selected from the group consisUng of SEQ ID NO:26-50, b) a naturally occurring 
polynucleoUde sequence having at least 90% sequence idenUty to a polynucleoUde sequence selected 
from the group consisUng of SEQ ID NO:26-50, c) a polynucleotide sequence complementary to a) f or 
10 d) a polynucleotide sequence complementary to b). The method comprises a) hybridizing the sample 
with a probe comprising at least 16 contiguous nucleoUdes comprising a sequence complementary to 
said target polynucleoUde in the sample, and which probe specifically hybridizes to said target 
polynucleotide, under condiUons whereby a hybridizaUon complex is formed between said probe and 
said target polynucleoUde, and b) detecUng the presence or absence of said hybridization complex, and 
15 opUonally, if present, the amount thereof. In one alternaUve, the probe comprises at least 30 conUguous 
nucleoUdes. In another alternaUve, the probe comprises at least 60 conUguous nucleoUdes. 

The invenUon further provides a pharmaceuUcal composiUon comprising an effecUve amount 
of a polypepUde comprising a) an amino acid sequence selected from the group consisUng of SEQ ID 
NO:l -25, b) a naturally occurring amino acid sequence having at least 90% sequence idenUty to an 
20 amino acid sequence selected from the group consisUng of SEQ ID NO: 1-25, c) a biologically acUve 
fragment of an amino acid sequence selected from die group consisUng of SEQ ID NO.1-25, or d) an 
immunogenic fragment of an amino acid sequence selected from the group consisUng of SEQ ID NO: 1 - 
25, and a pharmaceuUcally acceptable excipient The invenUon addiuonally provides a method of 
treating a disease or condiUon associated with decreased expression of funcUonal EXMAD, comprising 
25 administering to a paUent in need of such treatment the pharmaceuUcal composiUon. 

The invenUon also provides a method for screening a compound for effecUveness as an 
agonist of a polypepUde comprising a) an amino acid sequence selected from the group consisUng of 
SEQ ID NO: 1-25, b) a naturally occurring amino acid sequence having at least 90% sequence idenUty 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-25, c) a biologically 
30 acUve fragment of an amino acid sequence selected from the group consisting of SEQ ID NO:l-25, or 
d) an immunogenic fragment of an amino acid sequence selected from the group consisUng of SEQ ID 
NO: 1-25. The method comprises a) exposing a sample comprising the polypeptide to a compound, 
and b) detecUng agonist acUvity in the sample. In one alternaUve, the invenUon provides a 
pharmaceuUcal composition comprising an agonist compound idenUfied by the method and a 

10 
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pharmaceutical^ acceptable excipient. In another alternative, the invention provides a method of 
treating a disease or condition associated with decreased expression of functional EXMAD, 
comprising administering to a patient in need of such treatment the pharmaceutical composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 

5 an antagonist of a polypeptide comprising a) an amino acid sequence selected from the group consisting 
of SEQ ID NO:l -25, b) a naturally occurring amino acid sequence having at least 90% sequence 
identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1 : 25, c) a 
biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1 -25, or d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

10 of SEQ ID NO:l -25. The method comprises a) exposing a sample comprising the polypeptide to a 
compound, and b) detecting antagonist activity in the sample. In one alternative, the invention 
provides a pharmaceutical composition comprising an antagonist compound identified by the method 
and a pharmaceutical^ acceptable excipient. In another alternative, the invention provides a method 
of treating a disease or condition associated with overexpression of functional EXMAD, comprising 

15 administering to a patient in need of such treatment the pharmaceutical composition. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:26-50, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 

20 expression of the target polynucleotide. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble fuH- 
25 length sequences encoding EXMAD. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of EXMAD. 

Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 
patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
30 conditions associated with these tissues; and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 

encoding EXMAD were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze EXMAD, along with 
applicable descriptions, references, and threshold parameters. 

35 
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DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is .for the purpose of describing 
5 particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an/' 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
10 reference to one or more antibodies and equivalents thereof known to those skilled in the an, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 

15 practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 
not entitled to antedate such disclosure by virtue of prior invention. 

20 DEFINITIONS 

"EXM AD" refers to the amino acid sequences of substantially purified EXMAD obtained fron 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
25 EXMAD. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of EXMAD either by directly interacting with 
EXMAD or by acting on components of the biological pathway in which EXMAD participates. 

An "allelic variant" is an alternative form of the gene encoding EXMAD. Allelic variants ma> 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
30 polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times i 
a given sequence. 

12 
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"Altered" nucleic acid sequences encoding EXMAD include those sequences wilh deletions, 
insertions or substitutions of different nucleotides, resulting in a polypeptide the same as EXMAD or a 
polypeptide with at least one functional characteristic of EXMAD. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
5 the polynucleotide encoding EXMAD, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
EXMAD. The encoded protein may also be "altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent EXMAD. Deliberate amino acid substitutions may be made on the basis of similarity in 
10 polarity, charee, solubility, hydrophobic^, hydrophiliciiy, and/or the amphipathic nature of the 
residues, as lone as the biological or immunological activity of EXMAD is retained. For example, 
neeatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
ammo acids may include lysine and arginine. Amino acids with uncharged polar side chains having , 
similar hydrophilicity values may include: asparagine and elutamine; and serine and threonine. 
15 Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucinc, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules Where "amino acid sequence" is recited to refer to an amino acid sequence of a naturally 
20 occurrinc protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

• Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PGR) technologies well known 
in the art. 

25 The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 

EXMAD. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of EXMAD either by 
directly interacung with EXMAD or by acting on components of the biological pathway in which 
EXMAD participates. 

30 The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 

such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind EXMAD polypeptides can be prepared using intact polypeptides or using 
foments containing small peptides of interest as the immunizing amigen. The polypeptide or 
oligopeptide used to immunize an animal (eg., a mouse, a rat, or a rabbit) can be derived from the 

35 translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
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Commonly used carriers that are chemically coupled to peptides include bovine serum albumin.' 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize * 
the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
5 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regioas of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. • 1 

10 The term "antisense" refers to any composition capable of base-pairing with the 1 sense" strand 

of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; peptide nucleic 
acid (PNA); oligonucleotides having modified backbone linkages such as phosphorothioates, 
methylphosphonates, or benzylphosphonates; oligonucleotides having modified sugar groups such as 2- 
methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having modified bases such as 5- 

15 methyl cytosine, 2-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense molecules may be produced 

by any method including chemical synthesis or transcription. Once introduced into a cell, the 

i 

complementary antisense molecule base-pairs with a naturally occurring nucleic acid sequence 
produced by the cell to form duplexes which block either transcription or translation. The desicnadon 
"negative" or "minus" can refer to the antisense strand, and the designation "positive" or "plus" can 

20 refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the capability 
of the natural, recombinant, or synthetic EXM AD, or of any oligopeptide thereof, to induce a specific 
immune response in appropriate animals or cells and to bind with specific antibodies. 

25 The ^rms "complementary" and "complementarity" refer to the natural binding of 

polynucleotides by base pairing. For example, the sequence "5' A-G-T 3"' bonds to the complementary 
sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules may be "partial," 
such that only some of the nucleic acids bind, or it may be "complete," such that total complementarity 
exists between the single stranded molecules. The degree of complementarity between nucleic acid 

30 strands has significant effects on the efficiency and strength of the hybridization between the nucleic 
acid strands. This is of particular importance in amplification reactions, which depend upon binding 
between nucleic acid strands, and in the design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
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amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding EXMAD or fragments of EXMAD may 
be employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
5 deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components {e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to resolve 
uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or the 3' 
direction, and resequenced, or which has been assembled from the overlapping sequences of one or 
10 more lncyte Clones and, in some cases, one or more public domain ESTs, using a computer program 
for fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madisqn Wl). Some 
sequences have been both extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that, when made, least interfere 
with the properties of the original protein, i.e., the structure and especially the function of the protein is 
15 conserved and not significantly changed by such substitutions. The table below shows amino acids 

which may be substituted for an original amino acid in a protein and which are regarded as conservative 

amino acid substitutions. 

Original Residue Conservative Substitution 



20 



Asn, Glu 
Ala, Ser 



25 



Ala Glv ' Ser 

Arg His, Lys 

Asn Asp, Gin, His 

Asp 
Cys 

Gln Asn, Glu, His 

G j u Asp, Gin, His 

Gly ^ a 

His Asn, Arg, Gin, Glu 

He Leu, Val 

Leu " e - Val 



30 Lys 



Arg, Gin, Glu 



35 



Met Leu ' lle 

phe His, Met, Leu, Tip, Tyr 

Ser Cys.Thr 

Thr Ser, Val 

Trp Phe, Tyr 

Tyr His, Phe, Trp 

Val He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
40 backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 

(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or <c) the bulk of the 

15 
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side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the - 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
5 polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, hydroxyL or amino group. A derivative 
polynucleoude encodes a polypeptide which retains at least one biological or immunological function of 
the natural molecule. A derivative polypeptide is one modified by glycosylate, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide from ' 
10 which it was derived. ' 

A "fragment" is a unique portion of EXMAD or the polynucleotide encoding EXMAD which 
is identical in sequence to but shorter in length than the parent sequence. A fragment may comprise ' 
up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from 5 to 1 000 contiguous nucleotides or amino acid residues. A 

15 fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 
5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 
molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino 
acids selected from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown 

20 in a certain defined sequence. Clearly these lengths are exemplary, and any length that is supported 
by the specification, including the Sequence Listing, tables, and figures, may be encompassed by the 
present embodiments. 

A fragment of SEQ ID NO:26-50 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:26-50, for example, as distinct from any other sequence in the 

25 same genome. A fragment of SEQ ID NO:26-50 is useful, for example, in hybridization and 

amplification technologies and in analogous methods that distinguish SEQ ID NO:26-50 from related 
polynucleotide sequences. The precise length of a fragment of SEQ ID NO:26-50 and the region of 
SEQ ID NO:26-50 to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

30 A fragment of SEQ ID NO: 1 -25 is encoded by a fragment of SEQ ID NO:26-50. A fragment 

of SEQ ID NO: 1-25 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO.1-25. For example, a fragment of SEQ ID NO:l -25 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO:l-25. The precise length of 
a fragment of SEQ ID NO: 1 -25 and the region of SEQ ID NO: 1 -25 to which the fragment 
35 corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
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purpose for the fragment. 

The lerm "similarity" refers to a degree of complementarity. There may be partial similarity or 
complete similarity. The word "identity" may substitute for the word "similarity." A partially 
complementary sequence thai at least partially inhibits an identical sequence from hybridizing to a 
5 target nucleic acid is referred to as "substantially similar." The inhibition of hybridization of the 
completely complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced 
stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the 
binding of a completely similar (identical) sequence to the target sequence under conditions of reduced . 
10 stringency. This is not to say that conditions of reduced stringency are such that non-specific binding is 
permitted, as reduced stringency conditions require that the binding of two sequences to one another be 
a specific (i.e., a selective) interaction. The absence of non-specific binding may be tested by the use of 
a second target sequence which lacks even a partial degree of complementarity (e.g., less than about . 
30% similarity or identity). In the absence of non-specific binding, the substantially similar sequence or 
1 5 probe will not hybridize to the second non-complementary target sequence. 

The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
20 achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3. 1 2e sequence 
alinnment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
25 and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4. and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequence pairs. 
30 " Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi:nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
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programs including "blastn/ 1 thai is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nili.gov/gorf/bI2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Wore I Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 
the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, ai 
least 30, at least 40, at least 50, at least 70 r at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes ii 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the hydrophobicity and acidity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameter 
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of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12c sequence alignment , 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple= 1 , gap penalty^, window=5, and - 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
5 polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

Alternatively the NCB1 BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences' tool Version 2.0.9 
(May-07-1 999) with blastp set at default parameters. Such default parameters may be, for example: , 

10 Matrix: BLOSUM62 

Open Gap: 1 1 and Extension Gap: 1 penalties 

Gap x drop-off: 50 
Expect: JO 
Word Size: 3 
15 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
20 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
25 stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid sequence 
in the non-antigen binding regions has been altered so that the antibody more closely resembles a human 
antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
30 complementary strand through base pairing under defined hybridization conditions. Specific 

hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific 
hybridization complexes form under permissive annealing conditions and remain hybridized after the 
"washing" step(s). The washing step(s) is particularly important in determining the stringency of the 
hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding 
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between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skilfin the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
5 annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1 % (w/v) 
SDS, and about 100 ug/ml denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Generally, such wash temperatures are selected to be about 
5°C to 20T lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
10 strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating tJ, and conditions 
for nucleic acid hybridization are well known and can be found in Sambrook et al., 1 989, Molecular ' 
Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

15 High stringency conditions for hybridization between polynucleotides of the present invention 

include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
20 denatured salmon sperm DNA at about 1 00-200 ug/ml. Organic solvent, such as formamide at a 
concentration of about 35-50% v/v, may also be used under particular circumstances, such as for 
RNA:DNA hybridizations. Useful variations on these wash conditions will be readily apparent to those 
of ordinary skill in the art. Hybridization, particularly under high stringency conditions, may be 
suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative of a 
25 similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridizatioi 
complex may be formed in solution (e.g., C 0 t or R 0 t analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
30 paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
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disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
various factors, eg., cytokines, chemokines, and other signaling molecules, which may affect cellular 

and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of EXMAD which is 
> capable of eliciting an immune response when introduced into a living organism, for example, a 

mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
EXMAD which is useful in any of the antibody production methods disclosed herein or known in the 



art. 



The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 
10 The terms "element" and "array element" in a microarray context, refer to hybridizable 

polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of EXMAD. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, • 
functional, or immunological properties of EXMAD. 
15 The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 

polynucleotide, or any fragment thereof. These phrases also refer to DN A or RN A of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
20 functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcripuon or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PN As 
preferentially bind complementary single stranded DNA or RN A and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Probe" refers to nucleic acid sequences encoding EXMAD, their complements, or fragments 
30 thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 

isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, andenzymes. "Primers" are 
short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA. 
35 polymerase enzyme. Primer pairs can be used for ampliiication (and identification) of a nucleic acid 
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sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous' 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
5 or at least 1 50 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers ma 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual . 2™' ed., vol, 1-3, Cold 
10 Spring Harbor Press, Plainview NY; Ausubel et al.,1987. Current Protocols in Molecular Biolopy 

Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis et al., 1990, PCR Protocols A rinirt. 
to Methods and Applications, Academic Press, San Diego CA. PCR primer pairs can be derived from 
a known sequence, for example, by using computer programs intended for that purpose such as Primer 
(Version 0.5 , 1 99 1 , Whitehead Institute for Biomedical Research, Cambridge MA). 
15 Oligonucleotides for use as primers are selected using software known in the art for such 

purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
1 00 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
20 primer selection program (available to the public from the Genome Center at University of Texas Soutl 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome- wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "mispriming library," in which sequences to avoid as prime 
25 binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides fc 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
iheir respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers thai 
30 hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
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polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 

those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
5 This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra. The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. , 
,0 Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be partof a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 

instead of deoxyribose. . ' . 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encodine EXMAD, or fragments thereof, or EXMAD itself, may comprise a bodily fluid; an 
20 extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, 
RNA, or cDNA, in solution or bound to a substrate; a tissue: a tissue print: etc. 

' The terms "specific binding" and "specifically binding" refer to that interaction between a , 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
25 of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 

example, if an antibody is specific for epitope "A," the presence of a polypeptide containing the epitope 
A. or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
30 removed from their natural environment and are isolated or separated, and are at least 60% free, 

preferably at least 75% free, and most preferably at least 90% free from other components with which 

they are naturally associated. 

A -substitution" refers to the replacement of one or more amino acids or nucleotides by 

different amino acids or nucleotides, respectively. 
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"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, ' 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 
5 "Transformation" describes a process by which exogenous DNA enters and changes a recipient 

cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
of host cell being transformed and may include, but is not limited to, viral infection, electroporation, 
10 heat shock, Iipofection, and particle bombardment. The term "transformed" cells includes stably 
transformed cells in which the inserted DNA is capable of replication either as an autonomously 
replicating plasmid or as part of the host chromosome, as well as transiently transformed cells which 
express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
15 animals and plants, in which one or more of the cells of the organism contains heterologous nucleic- 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
20 vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, and plants and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the an, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 
25 into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having ; 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 

30 set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an "allelic 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significa: 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 

35 due to alternate splicing of exons during mRNA processing. The corresponding polypeptide may 
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possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 
variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
5 species. Polymorphic variants also may encompass "single nucleotide polymorphisms'* (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 
least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
10 polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2,0.9' (May-07-1999) 
set at default parameters,. Such a pair of polypeptides may show, for example, at least 50%, at least . 
60%, at least 70%, at least 80%, at least 90%, at least 95% , or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 
THE INVENTION 

15 The invention is based on the discovery of new human extracellular matrix and adhesion- 

associated proteins (EXMAD), the polynucleotides encoding EXM AD, and the use of these 
compositions for the diagnosis, treatment, or prevention of cell proliferative, immune, reproductive, 
neuronal, and genetic disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
20 EXMAD. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 

polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones 
in which nucleic acids encoding each EXMAD were identified, and column 4 shows the cDNA libraries 
from which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA 
libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries. 
25 In some cases, GenBank sequence identifiers are also shown in column 5. The Incyte clones and 
GenBank cDNA sequences, where indicated, in column 5 were used to assemble the consensus 
nucleotide sequence of each EXMAD and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
30 polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylate 
sites: column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical methods 
and in some cases, searchable databases to which the analytical methods were applied. The methods of 
column 7 were used to characterize each polypeptide through sequence homology and protein motifs. 
35 The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
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associated with nucleotide sequences encoding EXMAD. The first column of Table 3 lists the* 
nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 
NO:26-50 and to distinguish between SEQ ID NO:26-50 and related polynucleotide sequences. The 
5 polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 
lists tissue categories which express EXMAD as a fraction of total tissues expressing EXMAD. 
Column 4 lists diseases, disorders, or conditions associated with those tissues expressing EXMAD as a 
fraction of total tissues expressing EXMAD. Column 5 lists the vectors used to subclone each cDN A 
library. 

10 The columns of Table 4 show descriptions of the tissues Used to construct the cDNA libraries 

from which cDNA clones encoding EXMAD were isolated. Column 1 references the nucleotide SEQ 
ID NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 
shows the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:42 maps to chromosome 8 within the interval from 64.6.0 to 90.20 centiMorgans. 

15 SEQ ID NO:48 maps to chromosome 2 within the interval from 193.60 to 197.60 centiMorgans. 

The invention also encompasses EXMAD variants. A preferred EXMAD variant is one which 
has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the EXMAD amino acid sequence, and which contains at least one functional or 
structural characteristic of EXMAD. 

20 ^ invention also encompasses polynucleotides which encode EXMAD. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
the group consisting of SEQ ID NO:26-50, which encodes EXMAD. The polynucleotide sequences of 
SEQ ID NO:26-50, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 

25 is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding EXMAD. In 
particular, such a variant polynucleotide sequence will have at least about 80%, or alternatively at least 
about 90%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding EXMAD. A particular aspect of the invention encompasses a variant of a polynucleotide 

30 sequence comprising a sequence selected from the group consisting of SEQ ID NO:26-50 which has at 
least about 80%, or alternatively at least about 90%, or even at least about 95% polynucleotide 
sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO;26-50. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of EXMAD. 
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It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding EXMAD, some bearing minimal similarity to 
the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
5 by selecting combinations based on possible codon choices. These combinations are made in 

accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
occurring EXMAD, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode EXMAD and its variants are generally capable of 
hybridizing to tne nucleotide sequence of the naturally occurring EXMAD under appropriately selected 
10 condiuons of stringency, it may be advantageous to produce nucleotide sequences encoding EXMAD or 
its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
15 sequence encoding EXMAD and its derivatives without altering the encoded amino acid sequences 

include the production of RNA transcripts having more desirable -properties, such as a greater half-life, 
than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode EXMAD and 
EXMAD derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
20 synthetic sequence m?y be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding EXMAD or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
25 NO:26-50 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol, 
152:507-51 1 .) Hybridization condiuons, including annealing and wash condiuons, are described in 
"Definiuons." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
30 embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of 

DNA polymerase 1, SEQUEN ASE (US Biochemical, Cleveland OH), Taq polymerase (Perkin-Elmer), 
thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or combinations of 
polymerases and proofreading exonucleases such as those found in the ELONG ASE amplification 
system (Life Technologies, Gaithersburg MD). Preferably, sequence preparaUon is automated with 
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machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal 
cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Perkin-Elmer). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Perkin- 
Elmer), theMEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA), or 
5 other systems known in the art. The resulting sequences are analyzed using a variety of algorithms 
which are well known in the art. (See, e.g., Ausubel, F.M. (1 997) Short Protocols in Molecular 
Biology, John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biolopy and 
Bjotechnology, Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding EXMAD may be extended utilizing a partial nucleotide 
1 0 sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:31 8-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
15 sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8 1 86.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
lo known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
20 ligations may be used to insert an engineered double-stranded sequence into a. region of unknown 

sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
25 finding intron/exon junctions. For all PCR-based methods, primers may be designed using 

commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68 °C to 
72°C. 

30 When screening for full-length cDNAs, it is preferable to use libraries that have been 

size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. 
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Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers lor electrophoreiic separation, lour different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
5 emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process 
from loading of samples to computer analysis and electronic data display may be computer controlled. 
Capillary electrophoresis is especially preferable for sequencing small DNA fragments which may be 
present in limited amounts in a particular sample. 
10 In another embodiment of the invention, polynucleotide sequences or fragments thereof which 

encode EX MAD may be cloned in recombinant DNA molecules that direct expression of EXMAD, or , 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express EXMAD. 
1 5 The nucleotide sequences of the present invention can be engineered using methods generally 

known in the art in order to alter EXM AD-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
20 mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylate patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
25 Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of EXMAD, such as its biological or enzymatic activity or its 
ability to bind to other molecules or compounds. DNA shuffling is a process by which a library of 
gene variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
30 properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
35 homologous genes in the same gene family, either from the same or different species, thereby 
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maximizing the geneUc diversity of multiple naturally occurring genes in a directed and controllable ' 
manner. 

In another embodiment, sequences encoding EXMAD may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
5 Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively. 
EXMAD itself or a fragment thereof may be synthesized using chemical methods. For example, 
peptide synthesis can be performed using various solid-phase techniques. (See, e.g., Roberge, J. Y. et al. 
(1995) Science 269:202-204.) Automated synthesis may be achieved using the ABI 431A peptide 
synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of EXMAD, or any part thereof, ' 
10 may be altered during direct synthesis and/or combined with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. andF.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
1 5 (See, e.g., Creighton, T. ( 1 984) Proteins. Structures and Molecular Prop erties WH Freeman, New 
York NY.) 

In order to express a biologically acti ve EXMAD, the nucleotide sequences encoding EXMAD 
or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 

20 sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5 ' and 3 ; untranslated regions in the vector and in 
polynucleotide sequences encoding EXMAD. Such elements may vary in their strength and specificity. 
Specific initiation signals may also be used to achieve more efficient translation of sequences encoding 
EXMAD. Such signals include the ATG initiation codon and adjacent sequences, e.g. the Kozak 

25 sequence. In cases where sequences encoding EXMAD and its initiation codon and upstream 

regulatory sequences are inserted into the appropriate expression vector, no additional transcriptional o 
translaUonal control signals may be needed. However, in cases where only coding sequence, or a 
fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG 
initiation codon should be provided by the vector. Exogenous translational elements and initiation 

30 codons may be of various origins, both natural and synthetic. The efficiency of expression may be 

enhanced by the inclusion of enhancers appropriate for the particular host cell system used. (See, e.g . , 
Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding EXMAD and appropriate transcriptional and translational 
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control elements. These methods include mvitro recombinant DNA techniques, synthetic techniques, 
and invivo eenetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molpcnlar Cloning,, A 
t ^torv Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8. and 16-17; Ausubel, F.M. et 
A (199 5) rwm Protocols in Modular Biology . John Wiley & Sons. New York NY, ch. 9, 13, and 

5 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding EXMAD. These include, but are not limited to. microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems, infected with viral expression vectors (eg., baculovirus); . 
10 plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic vims, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g.. T» or P BR322 plasmids); or 
animal cell systems. The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding EXMAD. For example, routine cloning, 
15 subcloning, and propagation of polynucleotide sequences encoding EXMAD can be achieved using a 
Afunctional E^coli vector such as PBLUESCR1PT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
(Life Techholoeies). Ligation of sequences encoding EXMAD into the vector's multiple cloning site 
disrupts the lacL gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for myjtro 
20 transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 

deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster <1 989) J. Biol. Chem. 
264 5503-5509.) When large quantities of EXMAD are needed, e.g. for the production of anubocaes, , 
vectors which direct high level expression of EXMAD may be used. For example, vectors containing 
the strong, inducible T5 or T7 bacteriophage promoter may be used. 
25 Yeast expression systems may be used for production of EXMAD. A number of vectors 

contains constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Sacrharomvces cerevisiae or Pjchia^toris. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1 995, supra; 
30 Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer. CA. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of EXMAD. Transcription of sequences 
encoding EXMAD may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu. N. (1987) EMBO J. 
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6:307-3 1 1 ). Alternatively, plain promoters such as the small subunil of RUBISCO or heat shock . ' 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) These 
constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 
5 transfection. (See, e.g., The McGraw Hill Y earbook of Science and TechnolnpY ( 1 QQ9S Mrr,r„„ » ni 
New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding EXMAD may be ligated into 
an adenovirus transcripuon/translauon complex consisting of the late promoter and tripartite leader 
10 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 

infective virus which expresses EXMAD in host cells.. (See, e.g., Logan, J. and T. Shehk (1984) Proc. 
Natl. Acad. Sci. USA 81 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 
15 Human artificial chromosomes (H ACs) may also be employed to deliver larger fragments of 

DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) 
For long term production of recombinant proteins in mammalian systems, stable expression of 
20 EXMAD in cell lines is preferred. For example, sequences encoding EXMAD can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
25 selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include 
but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferas< 
genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1:223-232; 
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can t>t 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-41 8: and als and pat confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respecuvely. (See, e.g., Wigler, M. et al. (1980) 
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Proc Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin. F. et al. (1981) J. Mi*. Biol. 150:1 -14.) 
Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metaboU.es. (See. e.g., Hartman, S.C. and R.C. Mulligan (1988) Prcc. Natl. Acad. Sci. USA . 
85:8047-8051 .) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B . 
5 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, eg., Rhodes, C. A. 
(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is , 
, o also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding EXMAD is inserted within a marker gene sequence, transformed cells containing 
sequences encodim- EXMAD can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding EXMAD under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
15 expression of the tandem gene as well. 

in general, host cells that contain the nucleic acid sequence encoding EXMAD and that express 
EXMAD may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
20 chip based technologies for the detection and/or quanUfication of nucleic acid or protein sequences. 

Immunoloeical methods for detecUng and measuring the expression of EXMAD using either 
specific polyclonal W monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
25 anybodies reactive to two non-interfering epitopes on EXMAD is preferred, but a competitive binding 
assay may be employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et 
al. (1990) g-m~ir,i Muhnds » 1 Moratory Manual. APS Press, St. Paul MN. Sect. IV; Coligan, JJE- 
et al . (19 97) r.^ Protocols in Immunology . Greene Pub. Associates and Wiley-lnterscience, New 
York NY; and Pound, J.D. (1998) immunochemical Protocols, Humana Press, Totowa NJ.) 
30 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding EXMAD include 
olieolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding EXMAD. or any fragments thereof, may be cloned into a vector 
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for the production of an mRNA probe. Such veciors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase - 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding EXM AD may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellular^ depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression veciors containing 
polynucleotides which encode EX MAD may be designed to contain signal sequences which direct 
secretion of EXM AD through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDGK, HEK293, and WI38) are available from the American Type Culture 
Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding EXMAD may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric EXMAD protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of EXMAD activity. Heterologous protein anc 
peptide moieties may also facilitate purification of fusion proteins using commercially available affinit> 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, an 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
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these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the EXMAD encoding sequence and the heterologous protein sequence, so that 
EXMAD may be cleaved away from the heterologous moiety following purification. Methods for 
fusion protein expression and purification are discussed in Ausubel (1995, supra, ch. 10). A variety of 
5 commercially available kits may also be used to facilitate expression and purification of fusion protems. 

in a further embodiment of the invention, synthesis of radiolabeled EXMAD may be achieved 
invitro US i n g the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
10 precursor, for example, 35 S-methionine. 

Fragments of EXMAD may be produced not only by recombinant means, but also by direct 
peptide synthesis using solid-phase techniques. (See, e.g., Creighton. supra, PP- 55-60.) Protein 
synthesis may be performed by manual techniques or by automation. Automated synthesis may be . 
achieved, for example, using the AB1 431 A peptide synthesizer (Perkin-Elmer). Various fragments ot 
15 EXMAD may be synthesized separately and then combined to produce the full length molecule. 
THERAPEUTICS 

Chemical and structural similarity, e.g.,' in the context of sequences and motifs, exists 
between reeions of EXMAD and extracellular matrix and adhesion-associated proteins. In addition, 
the expression of EXMAD is closely associated with cancerous, proliferating, inflamed, nervous, 
20 reproductive, urologic, hematopoietic/immune, cardiovascular, musculoskeletal, developmental, and 
gastrointestinal tissues, and with cell proliferative disorders, including cancer, inflammation and the 
immune response. Therefore, EXMAD appears to play a role in cell proliferative, immune, 
reproductive, neuronal, and genetic disorders. In the treatment of disorders associated with increased 
EXMAD expression or activity, it is desirable to decrease the expression or activity of EXMAD. In 
25 the treatment of disorders associated with decreased EXMAD expression or activity, it is desirable to 
increase the expression or activity of EXMAD. 

Therefore, in one embodiment, EXMAD or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of EXMAD. Examples of such disorders include, but are not limited to, a cell prouferative 
30 disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuna, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, a cancer of 
the adrenal eland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
35 gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
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salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an immune disorder, such as 
inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's disease, * 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
5 autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis, 
cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis. Goodpasture's syndrome, gout, Graves* disease, Hashimoto's thyroiditis, 
paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, 
10 episodic lymphopenia with lymphocytotoxins. mixed connective tissue disease (MCTD), multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, myelofibrosis. osteoarthritis, 
osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis. Reiter s syndrome, rheumatoid 
arthritis, scleroderma. Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, primary thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, 
15 Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer 
including lymphoma, leukemia, and myeloma; a reproductive disorder, such as a disorder of prolactin 
production, infertility, including tubal disease, ovulatory defects, and endometriosis, a disruption of 
the estrous cycle, a disruption of the menstrual cycle, polycystic ovary syndrome, ovarian 
20 hyperstimulation syndrome, an endometrial or ovarian tumor, a uterine fibroid, autoimmune 

disorders, an ectopic pregnancy, and teratogenesis; cancer of the breast, fibrocystic breast disease, anc 
galactorrhea; a disruption of spermatogenesis, abnormal sperm physiology, cancer of the testis, cance, 
of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of 
the male breast, and gynecomastia; a neuronal disorder, such as akathesia, Alzheimer's disease, 
25 amnesia, amyotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, dementia, 
depression, diabetic neuropathy, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, 
Huntington's disease, peripheral neuropathy, multiple sclerosis, neurofibromatosis, Parkinson's 
disease, paranoid psychoses, postherpetic neuralgia, schizophrenia, and Tourette's disorder; and a 
genetic disorder, such as adrenoleukodystrophy, Alport's syndrome, choroideremia, Duchenneand 
30 Becker muscular dystrophy. Down' s syndrome, cystic fibrosis, chronic granulomatous disease, 

dentinogenesis imperfecta type II, dentin dysplasia type II, Gaucher's disease, Huntington's chorea, 
Marian's syndrome, muscular dystrophy, myotonic dystrophy, pycnodysostosis, Ref sum's syndrome 
retinoblastoma, sickle cell anemia, thalassemia, Werner syndrome, von Willebrand's disease, Wilms 
tumor, Zellweger syndrome, peroxisomal acyl-CoA oxidase deficiency, peroxisomal thiolase 
35 deficiency, peroxisomal Afunctional protein deficiency, nutochondrial carnitine palmitoyl transferase 
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and carnitine deficiency, mitochondrial very-Jong-chain acyl-CoA dehydrogenase deficiency, 
mitochondrial medium-chain acyl-CoA dehydrogenase deficiency, mitochondrial short-chain acyl- 
^ CoA dehydrogenase deficiency, mitochondrial electron transport flavoprotein and electron transport 

flavoprotein.ubiquinone oxidoreductase deficiency, mitochondrial trifunctional protein deficiency. 
5 and mitochondrial short-chain 3-hydroxyacyl-CoA dehydrogenase deficiency. 

In another embodiment, a vector capable of expressing EXMAD or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of EXMAD including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
10 EXMAD in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of EXMAD including, but not . 
limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of EXMAD may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
15 of EXMAD including; but not limited to, those listed above. 

In a further embodiment, an antagonist of EXMAD may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of EXMAD. Examples of such 
disorders include, but are not limited to, those cell proliferative, immune, reproductive, neuronal, and 
genetic disorders described above. In one aspect, an antibody which specifically binds EXMAD may be 
20 used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissues which express EXMAD. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding EXMAD may be administered to a subject to treat or prevent a disorder associated with 
increased expressions activity of EXMAD including, but not limited to, those described above. 
25 In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 

sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
30 disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of EXMAD may be produced using methods which are generally known in the 
art. In particular, purified EXMAD may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind EXMAD. Antibodies to EXMAD may 
35 also be generated using methods that are well known in the art. Such antibodies may include, but are 



37 



BNSDOCID: <WO 0068380A2_I_> 



TTV UU/UOJOU 



not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, arid 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
5 and others may be immunized by injection with EXM AD or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 

10 (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is prefenred that the oligopeptides, peptides, or fragments used to induce antibodies to 
EXM AD have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein and contain the 

15 entire amino acid sequence of a small, naturally occurring molecule. Short stretches of EXM AD amino 
acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 
may be produced. 

Monoclonal antibodies to EXMAD may be prepared using any technique which provides for 

the production of antibody molecules by continuous cell lines in culture. These include, but are not 
20 limited to, the hybridoma technique, the human B-cell hybridoma technique,. and the EBV-hybridoma 

technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 

Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
25 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 

antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 

Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; andTakeda, 

S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 

chain antibodies may be adapted, using methods known in the art, to produce EXM AD-specific single 
30 chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 

generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 

D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 

or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
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the literature. (See, e.g.. Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for EXMAD may also be generated. • 
For example, such fragments include, but are not limited to, fragments produced by pepsin 

5 digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the 
F(ab-)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse. W.D. et al. 

(1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
10 specificity. Numerous protocols for competitive binding or immunoradiometric assays 'using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art, Such 
immunoassays typically involve the measurement of complex formation between EXMAD and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering EXMAD epitopes is generally used, but a competitive binding assay may also be 

15 employed (Pound, surjra). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for EXMAD. Affinity is expressed as an association 
constant, K.. which is defined as the molar concentration of EXM AD-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The K, determined 
20 for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 

EXMAD epitopes, represents the average affinity, or avidity, of the antibodies for EXMAD. The K. 
determined for a preparation of monoclonal antibodies, which are monospecific for a particular 
EXMAD epitope, represents a true measure of affinity. High-affinity antibody preparations with K. 
ranging from about 10' to 10 12 L/mole are preferred for use in immunoassays in which the EXMAD- 
25 antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with K. 
ranging from about 10 6 to 10 7 Umole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of EXMAD, preferably in active form, from the 
antibody (Catty, D. (1988) Anrthnrtte. Volume 1- A Practical Approach, IRL Press, Washington, DC; 
Liddell, J.E. and Cryer, A. (1991) a P^niral finide to Monoclonal Antibodies, John Wiley & Sons, 
30 New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of EXM AD-antibody 
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complexes. Procedures for evaluating antibody specificity, tiler, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding EXMAD, or any 
5 fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the complement 
of the polynucleotide encoding EXMAD may be used in situations in which it would be desirable to 
block the transcription of the mRNA. In particular, cells may be transformed with sequences 
complementary to polynucleotides encoding EXMAD. Thus, complementary molecules or fragments 
may be used to modulate EXMAD activity, or to achieve regulation of gene function. Such technology 
10 is now well known in the art, and sense or antisense oligonucleotides or larger fragments can be 

designed from various locations along the coding or control regions of sequences encoding EXMAD. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or 
from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted organ, 
tissue, or cell population. Methods which are well known to those skilled in the art can be used to 
15 construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding 
EXMAD. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra .) 

Genes encoding EXMAD can be turned off by transforming a cell or tissue with expression 
vectors which express high levels of a polynucleotide, or fragment thereof, encoding EXMAD. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in the 
20 absence of integration into the DN A, such vectors may continue to transcribe RN A molecules until they 
are disabled by endogenous nucleases. Transient expression may last for a month or more with a 
non-replicating vector, and may last even longer if appropriate replication elements are part of the 
vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
25 complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5 T , or regulatory 
regions of the gene encoding EXMAD. Oligonucleotides derived from the transcription initiation site, 
e.g., between about positions -10 and +10 from the start site, may be employed. Similarly, inhibition 
can be achieved using triple helix base-pairing methodology. Triple helix pairing is useftil because it 
causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
30 transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 
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Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
5 endonucleolytic cleavage of sequences encoding EXMAD. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
10 secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by • 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
15 chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 

Alternatively, RNA molecules may be generated by jnvftrg and invivo transcription of DNA sequences 
encoding EXMAD. Such DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constitutively or inducibly, can be introduced into cejl lines, cells, or 
20 tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 

25 extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 
use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stern cells taken 

30 from the patient and clonally propagated for autologous transplant back into that same patient. 

Delivery by transfecuon, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 

Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
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therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. * 

An additional embodiment of the invention relates to the administration of a pharmaceutical or 
sterile composition, in conjunction with a pharmaceutical^ acceptable carrier, for any of the therapeutic • 
5 effects discussed above. Such pharmaceutical compositions may consist of EXMAD, antibodies to 
EX MAD, and mimetics, agonists, antagonists, or inhibitors of EXMAD. The compositions may be 
administered alone or in combination with at least one other agent, such as a stabilizing compound, 
which may be administered in any sterile, biocompatible pharmaceutical carrier including, but not 
limited to, saline, buffered saline, dextrose, and water. The Compositions may be administered to a 
10 patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any number 
of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 
* 5 ' n addition to the active ingredients, these pharmaceutical compositions may contain suitable 

pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be used pharmaceutical^. Further details on ' 
techniques for formulation and administration may be found in the latest edition of Rfemineton's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). 
20 Pharmaceutical compositions for oral administration can be formulated using pharmaceutical! y 

acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers 
enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, 
gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active compounds 
25 with solid excipient and processing the resultant mixture of granules (optionally, after grinding) to 

obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable excipients include 
carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, and sorbitol; starch 
from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and 
30 tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, 
such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
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8lyc ol and,or utanium divide, lacquer solutions, am. suitable organic solvems or solvent mixtures. 
Dyestuffs or pigment may be added ,o the tablets or dragee coatings for product identificauon or to 
characlerize Ihe quanUly of active compound, i.e., dosage. 

Pharmaceuucal preparations which can be used orally include push-m capsules made of . 
5 ttelatin. as we,, as soft, sealed capsules made o, gelatin and a coating, such as giycero, or sorbito,. 
Push-fi, capsules can contain active ingredients mixed Witt, liners or binders, such as lactose or 
swches. lubricants, such as talc or magnesium stcrate. and. clonal,,, stabilizer, In soft capsutes. 
the active compounds may be dissolved or suspended in suitable liquids, such as f any oils, liqutd, or 
liquid polyethylene glycol with or without stabilizers. 
I0 Pharmaceutical formulations suitable for parenteral adminisu-ation may. be formulated tn 

aqueous solutions, preferably in physiologically compatible buffers such as HanKs" solution. Ringer, 
solmion, or physiologically buffered saline. Aqueous injection suspensions may certain substances ■ 
which increase the viscosity of the suspension, such as sodium carboxymohy, oelhnose, sorbitol, or 
oextran Addiuonally, suspensions of une active compounds may be prepared as appropriate otly 
,5 injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, sucl, as sesame oil. or 
synthe* fatty acid esters, such as «hy. oleate, triglycerides, o, liposomes. Non-.ipid polycaUonic 
amino polymers may a,so be used for dehvery. bpuonally. the suspension may also contain suitable 
stabilizers cr agents to increase the solubility o, the compounds and allow fo, the preparation of htghly 

concentrated solutions. . , 

20 For topical or nasal administration, penetrants appropriate to the particular bamer to be 

permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a manner, 
*at is known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-malung. 
legating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 
25 ~ The pharmaceutical composition may be provided as a salt and can be formed with many 
including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acids. 
Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free base 
forms, In other cases, the preparation may be a lyophilized powder which may contain any or all of the 
following: 1 mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitol, at a P H range of 
30 4 5 to 5 .5 , that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an approbate 
container and labeled for treatment of an indicated condition. For administration of EXMAD, such 
labeline would include amount, frequency, and method of administration. 

" Pharmaceutical composiuons suitable for use in the invention include compositions wherein th< 
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active ingredients are contained in an effective amount lo achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs. 
5 An animal model may also be used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
EXMAD or fragments thereof, antibodies of EXMAD, and agonists, antagonists or inhibitors of 
10 EXMAD, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such 
as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the 
dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 50 /ED 50 ratio. Pharmaceutical compositions which 
15 exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with little 
or no toxicity. The dosage varies within this range depending upon the dosage form employed, the 
sensitivity of the patieht, and the route of administration. 
20 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. 
25 Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 fxg to 1 00,000 fug, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
30 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind EXMAD may be used for the 
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diagnosis of disorders characterized by expression of EXM AD, or in assays to monitor patients being 
treated with EXMAD or agonists, antagonists, or inhibitors of EXMAD. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for EXMAD include methods which utilize the antibody and a label to detect 
5 EXMAD in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covajent or non-covalent attachmeht of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in the 
art and may be used. 

A variety of protocols for measuring EXMAD, including ELISAs, RIAs, and FACS, are 
,0 known in the art and provide a basis for diagnosing altered or abnormal levels of EXMAD expression. 
Normal or standard values for EXMAD expression are established^ combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibody to 
EXMAD under conditions suitable for "complex formation. The amount of standard complex formation 
ma y be quantitated by various methods, such as photometric means. Quantities of EXMAD expressed 
15 in subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding EXMAD may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DN A molecules, and PNAs: The polynucleotides maybe used to detect and 
20 quantify eene expression in biopsied tissues in which expression of EXMAD may becorrelated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
EXMAD, and to monitor regulation of EXMAD levels during therapeutic intervention. 

in one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding EXMAD or closely related molecules may be used 
25 to identify nucleic acid sequences which encode EXMAD. The specificity of the probe, whether .t ,s 
made from a hiehly specific region, e.g., the 5' regulatory region, or from a less specific region. e.g.. a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding EXMAD, allelic variant, or related 
sequences. 

30 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the EXMAD encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:26-50 or from 
genomic sequences including promoters, enhancers, and introns of the EXMAD gene. 

Means for producing specific hybridization probes for DNAs encoding EXMAD include the 
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cloning of polynucleotide sequences encoding EXMAD or EXMAD derivatives into vectors for the , 
production of mRN A probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RN A probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 
of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as 
alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding EXMAD may be used for the diagnosis of disorders 

associated with expression of EXMAD. Examples of such disorders include, but are not limited to, a 

» 

cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, a cancer of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an immune disorder, 
such as inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis, 
cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel syndrome, 
episodic lymphopenia with lymphocytotoxins, mixed connective tissue disease (MCTD), multiple 
sclerosis, myasthenia gravis, myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, 
osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoic 
arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, primary thrombocythemia, thrombocytopenic puipura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer 
including lymphoma, leukemia, and myeloma; a reproductive disorder, such as a disorder of prolacti 
production, infertility, including tubal disease, ovulatory defects, and endometriosis, a disruption of 
the estrous cycle, a disruption of the menstrual cycle, polycystic ovary syndrome, ovarian 
hyperstimulation syndrome, an endometrial or ovarian tumor, a uterine fibroid, autoimmune 
disorders, an ectopic pregnancy, and teratogenesis; cancer of the breast, fibrocystic breast disease, a_i 
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galactorrhea; a disruption of spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer 
of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of 
the male breast, and gynecomastia; a neuronal disorder, such as akathesia, Alzheimer's disease, 
amnesia, amyotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, dementia, 
5 depression, diabetic neuropathy, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, 
Huntington's disease, peripheral neuropathy, multiple sclerosis, neurofibromatosis, Parkinson's 
disease, paranoid psychoses, postherpetic neuralgia, schizophrenia, and Tourettes disorder; and a 
genetic disorder, such as adrenoleukodystrophy, Alport's syndrome, choroideremia, Duchenne and 
Becker muscular dystrophy. Down' s syndrome, cystic fibrosis, chronic granulomatous disease, 
10 dentinogenesis imperfecta type 11, dentin dysplasia type II, Gaucher's disease. Huntington's chorea, 
Marian's syndrome, muscular dystrophy, myotonic dystrophy, pycnodysostosis. Refsum's syndrome, 
retinoblastoma, sickle cell anemia, thalassemia, Werner syndrome, von Wjllebrand's disease, Wilms', 
tumor. Zellweger syndrome, peroxisomal acyl-CoA oxidase deficiency, peroxisomal thiolase 
deficiency, peroxisomal Afunctional protein deficiency, mitochondrial carnitine palmitoyl transferase 
15 and carnitine deficiency, mitochondrial very-long-chain acyl-CoA dehydrogenase deficiency. 

mitochondrial medium-chain acyl-CoA dehydrogenase deficiency, mitochondrial short-chain acyl- 
CoA dehydrogenase deficiency, mitochondrial electron transpon flavoprotein and electron transport 
flavoprotein:ubiquinone oxidoreductase deficiency, mitochondrial triftinctional proiein deficiency, 
and mitochondrial short-chain 3-hydroxyacyl-CoA dehydrogenase deficiency. The polynucleotide 
20 sequences encoding EXMAD may be used in Southern or northern analysis, dot blot, or other 

membrane-based technologies; in PCR technologies; in dipstick, pin, and muluformat ELIS A-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered EXMAD 
expression. Such qualitative or quantitative methods are well known in the art. 

in a particular aspect, the nucleotide sequences encoding EXMAD may be useful in assays that 
25 detect the presence of associated disorders, particularly those mentioned above. The nucleotide 

sequences encoding EXMAD may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 
value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
30 sample then the presence of altered levels of nucleotide sequences encoding EXMAD in the sample 

indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient. 

in order to provide a basis for the diagnosis of a disorder associated with expression of 
EXMAD, a normal or standard profile for expression is established. This may be accomplished by 
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combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding EXMAD, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from norma 
subjects with values from an experiment in which a known amount of a substantially purified 
5 polynucleotide is used. Standard values obtained in this manner may be compared with values obtained 
from samples from patients who are symptomatic for a disorder. Deviation from standard values is 
used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
10 patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment ova- a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 

15 of the disease, or may provide a means for detecting the disease prior to the appearance of actual 

clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding EXMAE 

20 may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymaticall\ 
or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding 
EXMAD, or a fragment of a polynucleotide complementary to the polynucleotide encoding EXMAD, 
and will be employed under optimized conditions for identification of a specific gene or condition. 
Oligomers may also be employed undo- less stringent conditions for detection or quantification of 

25 closely related DNA or RNA sequences. 

Methods which may also be used to quantify the expression of EXMAD include radiolabeline 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g„ Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 

30 accelerated by running the assay in a high-throughput format where the oligomer of interest is present* 
in various dilutions and a spectrophotometry or colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The microarray ca 
be used to monitor the expression level of large numbers of genes simultaneously and to identify genel 
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variants, mutations, and polymorphisms. This information may be used to determine gene function, to 
understand the genetic basis of a disorder, to diagnose a disorder, and to develop and monitor the 
activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See. e.g., 
Brennan T.M. et al. (1995) U.S. Patent No. 5.474.796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93 10614-10619; Baldeschweiler et al. (1995) PCT application WG95/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller. R.A et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155- and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding EXMAD may be used , 
to generate hybridization probes useful in mapping the naturaily occurring genomic sequence. The 
sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or to 
artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial • 
chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI constructions, or smgle 
chromosome cDNA libraries. (See, e.g., Harrington, J.J. a al. (1997) Nat. Genet. 15:345-355; Pnce, 
15 CM (l993)BloodRev.7:127-134;andTrask,B.J.(1991)TrendsGenet. 7:149-154.) 

Fluorescent injitu hybridization (FISH) may be correlated with other physical chromosome 
mappine techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 
965-968 ) Examples of genetic map data can be found in various scientific journals or at the Onlme 
Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the location of the 
gene encoding EXMAD on a physical chromosomal map and a specific. disorder, or a predisposauon to 
a specific disorder, may help define the region of DNA associated with that disorder. The nucleoude 
sequences of the invention may be used to detect differences in gene sequences among normal, earner, , 

and affected individuals. 

jnsitu hybridization of chromosomal preparations and physical mapping techniques, such as 
25 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 

Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the number or arm of a particular human chromosome is not known. 
New sequences can be assigned to chromosomal arms by physical mapping. This provides valuable 
information to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the disease or syndrome has been crudely localized by geneuc linkage to a particular 
genomic reaion, e. C ., ataxia-telangiectasia to llq22-23, any sequences mapping to that area may 
represent associated or regulatory genes for further investigation. (See. e.g., Gatti, R.A. et al. (1988) 
Nature 336 577-580.) The nucleotide sequence of the subject invention may also be used to detect 
differences in the chromosomal location due ,o translocation, inversion, etc., among normal, carrier, or 
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affected individuals. 

In another embodiment of the invention, EXMAD, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

5 solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between EXMAD and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 

10 synthesized on a solid substrate. The test compounds are reacted with EXMAD, or fragments thereof, 
and washed. Bound EXMAD is then detected by methods well known in the art. Purified EXMAD can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

5 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding EXMAD specifically compete with a test compound for binding 
EXMAD. In this manner, antibodies can be used to detect the presence of any peptide which shares one 
or more antigenic determinants with EXMAD. • ' 

In additional embodiments, the nucleotide sequences which encode EXMAD may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. No.60/1 33,643 and U.S. Ser. No.60/150,409 are hereby expressly incorporated 
by reference. 

EXAMPLES 

I- Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissue 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 
in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
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solution of phenol and guanidine isolhiocyanate. The resulting lysates were cenlrifuged over CsCl 
cushions or extracted with chloroform. RN A was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
5 purity. In some cases, RNA was treated with DNase. For most libraries, poly( A + ) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OL1GOTEX latex particles (Q1AGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (Q1AGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g... the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 
10 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDN A was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, units 
5.1 -6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
15 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g. , 
20 PBLUESCR1PT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2. 1 plasmid 
(Invitrogen, Carlsbad CA), or plNCY plasmid (Incyte Pharmaceuticals, Palo Alto CA). Recombinant 
plasmids were transformed into competent E^coli cells including XL1 -Blue, XLl-BlueMRF, or SOLR 
from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
11. Isolation of cDNA Clones 
25 Plasmids were recovered from host cells by in vivo excision using the UNIZAP vector system 

(Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a Magic or 
WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep purification kit (Edge 
Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, Q1AWELL 8 
Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit from Q1AGEN. 
30 Following precipitation, plasmids were resuspended in 0. 1 ml of distilled water and stored, with or 
without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
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well plates, and the concentration of amplified plasmid DNA was quantified fiuorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOR OS KAN II fluorescence scanner ./ 
(Labsystems Oy, Helsinki, Finland). 
111. Sequencing and Analysis 

cDN A sequencing reactions were processed using standard methods or high-throughput 
instrumentation such as the AB1 CATALYST 800 (Per kin-Elmer) thermal cycler or the PTC-200 
thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or 
the MICROLAB 2200 (Hamilton) liquid transfer system. cDN A sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech 6r supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 
Eleclrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were 
carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 
PRISM 373 or- 377 sequencing system (Perkin-Elmer) in conjunction with standard ABI protocols and 
base calling software; or other sequence analysis systems known in the art. Reading frames within the 
cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). 
Some of the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 
references, and threshold parameters. The first column of Table 5 shows the- tools; programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 
and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
eukaryote databases, and BLOCKS, PRINTS, DOMCX PRODOM, and PFAM to acquire annotation 



10 



PCTAJS00/12811 

WO 00/68380 ( 

using prosrams based on BLAST. FAST A, and BLIMPS. The sequences were assembled into full 
fcnoth polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
tor open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid sequence,, 
and these full leneth sequences were subsequently analyzed by querying against databases such as the 
GenBank databases (described above), SwissProt. BLOCKS, PRINTS. DOMO. PRODOM, Prosite. 
and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See. e.g., 
Eddy S R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

' The programs described above for the assembly and analysis of full length polynucleotide and 
amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:26-50. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 

IV. Northern Analysis 
15 Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

and involves the hybridization of a labeled nucleotide sequence to a membrane on which RN As from a 
particular cell type or tissue have been bound. (See, e.g., Sambrcok. supra, ch. 7; Ausubel, 1995, 

supra , ch. 4 and 16.) 

Analoeous computer techniques applying BLAST were used to search for identical or related 
20 molecules in nucleotide databases such as GenBank or LIFESEQ (lncyte Pharmaceuticals). This 

analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the 
computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of the search is the product score, which is defined as: 

<7n sequence identity x % m aximum BLAST score 

25 100 

The product score takes into account both the degree of similarity between two sequences and the lengt* 
of the sequence match. For example, with a product score of 40, the match will beexact within a 1 % to 
2% error, and, with a product score of 70, the match will be exact. Similar molecules are usually 
identified by selecting those which show product scores between 15 and 40. although lower scores may 

30 identify related molecules. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding EXMAD occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
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reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the - 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
5 3. 

V. Chromosomal Mapping of EXMAD Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:40-50 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 

implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
10 SEQ ID NO:40-50 were assembled into clusters of contiguous anirt overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Gen6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
15 of all sequences of that cluster, including its particular SEQ ID NO;, to that map location. 

The genetic map locations of SEQ ID NO:42 and SEQ ID NO:48 are described in The 
Invention as ranges, or intervals, of human chromosomes. The map position of an interval, in 
centiMorgans, is measured relative to the terminus of the chromosome's p-arm. (the centiMorgan 
(cM) is a unit of measyrement based on recombination frequencies between chromosomal markers. 
20 On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can 
vary widely due to hot and cold spots of recombination.) The cM distances are based on genetic 
markers mapped by Gen6thon which provide boundaries for radiation hybrid markers whose 
sequences were included in each of the clusters. 
VI. Extension of EXMAD Encoding Polynucleotides 
25 The ftill length nucleic acid sequences of SEQ ID NO:26-50 were produced by extension of an 

appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3 extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
30 in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 

temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed. 
35 Hi £h fidelity amplification was obtained by PCR using methods well known in the art. PCR 
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was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH 4 ),S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
5 pair PCI A and PCI B: Step 1 : 94"C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 rnin; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3. and 4 repeated 20 times; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C. 
10 The concentration of DNA in each well was determined by dispensing 1 00 Ml PICOGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 Ml of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
15 concentration of DNa! A 5 »l to 10 ^ aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with Cvill cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector, (Amersham Pharmacia Biotech). For 
20 shotgun sequencing, .the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were, excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to filL-in restriction site overhangs, 
and transfected into competent Rcoli cells. Transformed cells were selected on antibiotic-containing 
25 media, individual colonies were picked and cultured overnight at 37 °C in 384-well plates inLB/2x carb 
liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: 
Step 1: 94°C, 3 min; Step 2: 94*C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72"C, 2 min; Step 5: steps 2, 
30 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by 

PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries were 
reamplified using the same conditions as described above. Samples were diluted with 20% 
dimethysulloxide (1 :2, v/v), and sequenced using DYENAM1C energy transfer sequencing primers and 
the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE 
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Terminaior cycle sequencing ready reaction kit (Perkin-Elmer). 

In like manner, the nucleotide sequences of SEQ ID NO:26-50 are used to obtain 5 ? regulatory* 
sequences using the procedure above, oligonucleotides designed for such extension, and an appropriate 
genomic library. 

5 VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:26-50 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 

10 Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of [y- 32 P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 

15 genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba 1, or 
Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel, and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried Out for 1,6 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
20 under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5 % sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array elements 
25 on the surface of a substrate. (See, e.g., Baldeschweiler, supra .) An array analogous to a dot or slot 
blot may also be used to arrange and link elements to the surface of a substrate using thermal, UV, 
chemical, or mechanical bonding procedures. A typical array may be produced by hand or using 
available methods and machines and contain any appropriate number of elements. After hybridization, 
nonhybridized probes are removed and a scanner used to determine the levels and patterns of 
30 fluorescence. The degree of complementarity and the relative abundance of each probe which 

hybridizes to an element on the microarray may be assessed through analysis of the scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise the 
elements of the microarray. Fragments suitable for hybridization can be selected using software well 
known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or 
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fragments thereof corresponding to one of the nucleotide sequences of the present invention, or selected 
at random from a cDNA library relevant to the present invention, are arranged on an appropriate 
substrate, e.g., a glass slide. The cDN A is fixed to the slide using, e.g., UV cross-linking followed by ' 
thermal and chemical treatments and subsequent drying. (See, e.g., Schena, M. et al. (1995) Science 
5 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) Fluorescent probes are prepared and 
used for hybridization to the elements on the substrate. The substrate is analyzed by procedures 
described above. 

IX. Complementary Polynucleotides 

Sequences complementary to the EXMAD-encoding sequences, or any parts thereof, are used 
10 to detect, decrease, or inhibit expression of naturally occurring EX MAD. Although use of 

oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same procedure 
is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are designed 
using OLIGO 4.06 software (National Biosciences) and the coding sequence of EXMAD. To inhibit 
transcription, a complementary oligonucleotide is designed from the most unique 5' sequence and used 
15 to prevent promoter binding to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the EXMAD-encoding transcript. 

X. Expression of EXMAD 

Expression and purification of EXMAD is achieved using bacterial or virus-based expression 
systems. For expression of EXMAD in bacteria, cDNA is subcloned into an appropriate vector 
20 containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac Oac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express EXMAD upon induction with isopropyl bela-D- 
25 thiogalactopyranoside (IPTG). Expression of EXMAD in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Auto^raphica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding EXMAD by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
30 polyhedrin promoter drives high levels of cDN A transcription. Recombinant baculovirus is used to 
infect .S nnrtomera frueioerda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modificatioas to baculovirus. (See Engelhard, E.K. et 
al. (1994) Proc. Natl. Acad. Sci. USA 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 
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In most expression systems, EXMAD is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaporiicum . enables the purification of fusion proteins on immobilized 
5 glutathione under conditions that maintain proteih activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from EXMAD at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
10 (QIAGEN). Methods for protein expression and purification are ciiscussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified EXMAD obtained by these methods can be used directly in the following 
activity assay. 

XL Demonstration of EXMAD Activity 

An assay for EXMAD activity measures the disruption of cytoskeletal filament networks upon 

15 overexpression of EXMAD in cultured cell lines. (Rezniczek, G. A. et al. (1998) J. Cell Biol. 141 :209- 
225.) cDNA encoding EXMAD is subcloned into a mammalian expression vector that drives high 
levels of cDNA expression. This construct is transfected into cultured cells, such as rat kangaroo PtK2 
' or rat bladder carcinoma 804G cells. Actin filaments and intermediate filaments such as keratin and 

vimentin are visualized by immunofluorescence microscopy using antibodies and techniques well known 

20 in the art. The configuration and abundance of cyoskeletal filaments can be assessed and quantified 
using confocal imaging techniques. In particular, the bundling and collapse of cytoskeletal filament 
networks are indicative of EXMAD activity. 

Alternatively, an assay for EXMAD activity measures the amount of cell aggregation induced 
by overexpression of EXMAD. In this assay, cultured cells such as NIH3T3 are transfected with 

25 cDNA encoding EXMAD contained within a suitable mammalian expression vector under control of a 
strong promoter. Cotransfection with cDNA encoding a fluorescent marker protein, such as Green 
Fluorescent Protein (Clontech), is useful for identifying stable transfectants. The amount of cell 
agglutination, or clumping, associated with transfected cells is compared with that associated with 
untransfected cells. The amount of cell agglutination is a direct measure of EXMAD activity. 

30 Alternatively, cell adhesion activity in EXMAD is measured in a 96-well plate assay in which 

wells are first coated with EXMAD by adding solutions of EXMAD of varying concentrations to the 
wells. Excess EXMAD is washed off with saline, and the wells incubated with a solution of 1 % bovine 
serum albumin to block non-specific cell binding. AJiquots of a cell suspension of a suitable cell type 
are then added to the wells and incubated for a period of lime at 37 °C. Non-adhered cells are washed 
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off with saline and the cells stained with a suitable cell stain such as Coomassie blue. The intensity of 
staining is measured using a variable wavelength 96-well plate reader and compared to a standard curve 
to determine the number of cells adhering to the EXMAD coated plates. The degree of cell staining is 
proporUonal to the cell adhesion activity of EXMAD in the sample. 

5 Alternatively, EXMAD activity is also measured by the interaction of EXMAD with other 

molecules. EXMAD, or biologically active fragments thereof, are labeled with ,IS I Bolton-Hunter 
reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 1 33:529.) Candidate molecules previously arrayed 
in the wells of a multi-well plate are incubated with the labeled EXMAD, washed, and any wells with 
labeled EXMAD complex are assayed. Data obtained using different concentrations of EXMAD are 

10 used to calculate values for the number, affinity, and association of EXMAD with the candidate 
molecules. 

Xll. Functional Assays 

EXMAD function is assessed by expressing the sequences encoding EXMAD at 
physioloeically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
15 mammalian expression vector containing a strong promoter that drives high levels of cDNA expression. 
Vectors of choice include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid 
(Invitrosen), both of which contain the cytomegalovirus promoter. 5- 1 0 /.g of recombinant vector are 
transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell line, 
using either liposome formulations or electroporation. 1-2 jug of an additional plasmid containing 
20 sequences encoding a marker protein are co-transfected. Expression of a marker protein provides a 
means to distinguish transfected cells from nontransfected cells and is a reliable predictor of cDN A 
expression from the recombinant vector. Marker proteins of choice include, e.g„ Green Fluorescent 
Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an 
automated, laser optics-based technique, is used to identify transfected cells expressing GFP or CD64- 
25 GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects and 
quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with cell 
death. These events include changes in nuclear DNA content as measured by staining of DNA with 
propidium iodide; changes in cell size and granularity as measured by forward light scatter and 90 
degree side light scatter; down-regulation of DNA synthesis as measured by decrease in 
30 bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins as 

measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G. (1994) Flow Cytometry , Oxford, New York NY. 
The inlluence of EXMAD on gene expression can be assessed using highly purified 
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populations of cells transfected with sequences encoding EXM AD and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of 
human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells 
using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 
5 NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
Expression of mRNA encoding EXMAD and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 

XIII. Production of EXMAD Specific Antibodies 

EXMAD substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g„ 
10 Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the EXMAD amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
15 selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the an, (See, e.g., Ausubel, 1995, supra , ch. 1 1.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
20 immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-EXM AD 
activity by, for example, binding the peptide or EXMAD to a substrate, blocking with 1 % BS A, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring EXMAD Using Specific Antibodies 

25 Naturally occurring or recombinant EXMAD is substantially purified by immunoaffinity 

chromatography using antibodies specific for EXMAD. An immunoaffinity column is constructed by 
covalently coupling anti-EXMAD antibody to an activated chromatographic resin, such as 
CNBr-activated SEPH AROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

30 Media containing EXMAD are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of EXMAD (e.g„ high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/EXMAD binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and EXMAD is collected. 



60 



PCTAJS00/12811 

WO 00/68380 

XV. Identification of Molecules Which Interact with EXMAD 

EXMAD, or biologically active fragments thereof, are labeled with Bolton-Hunter reagent. 
(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubaied with the labeled EXMAD, washed, 
5 and any wells with labeled EXMAD complex are assayed. Data obtained using different concentrations 
of EXMAD are used to calculate values for the number, affinity, and association of EXMAD with the 
candidate molecules. 

Alternatively, molecules interacting with EXMAD are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246). or using commercially « 
10 available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the invention. ■ 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
15 Indeed, various modifications of the described modes for carrying out the inventibn which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO:l-25, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-25, 

c) a biologically active fragment bf an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-25, and 

10 d > an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1-25. ' • 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1 - 

25. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide of claim 3 selected from the group consisting of SEQ ID 
NO:26-50. 

20 

5. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

6. A cell transformed with a recombinant polynucleotide of claim 5. 

25 

7. A transgenic organism comprising a recombinant polynucleotide of claim 5. 

8. A method for producing a polypeptide of claim 1 , the method comprising: 

a) cuhuring a cell under conditions suitable for expression of the polypeptide, wherein said 
30 cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 

comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1 , and 

b) recovering the polypeptide so expressed. 

35 9. An isolated antibody which specifically binds to a polypeptide of claim 1 . 
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1 0. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:26-50, 

b) a naturally occurring polynucleotide sequence having at least 90% sequence identity to a 
5 polynucleotide sequence selected from the group consisting of SEQ ID NO.26-50, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary lo b), and 

e) an RNA equivalent of a)-d). 



10 



1 1 . An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 10. 



1 2. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 10, the method comprising: 
,5 a) hybridizingthesamplewithaprobecomprisingatleast 1 6 contiguous nucleotides 

comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
20 present, the amount thereof. 

1 3. A method of claim 12, wherein the probe comprises at least 30 contiguous nucleotides. 

14. A method of claim 12, wherein the probe comprises at least 60 contiguous nucleotides. 



25 



30 



35 



15. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 
1 and a pharmaceutical^ acceptable excipient. 

1 6. A method for treating a disease or condition associated with decreased expression of 
functional EXMAD, comprising administering to a patient in need of such treatment the 
pharmaceutical composition of claim 15. 

17. A method for screening a compound for effectiveness as an agonist of a polypeptide of 

claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
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b) detecting agonist activity in the sample. 

* 

1 8. A pharmaceutical composition comprising an agonist compound identified by a method 
of claim 17 and a pharmaceutical^ acceptable excipient. 

5 • . 

1 9. A method for treaung a disease or condition associated with decreased expression of 
functional EXMAD, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 18. 

10 20. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

15 21 • A Pharmaceutical composition comprising an antagonist compound identified by a 

method of claim 20 and a pharmaceutical^ acceptable excipient. 

22. A method for treating a disease or condition associated with overexpression of functional 
EXMAD, comprising administering to a patient in need of such treatment a pharmaceutical 

20 composition of claim 21. 

23. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 4, the method 
comprising: 

25 a ) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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SEQUENCE LISTING 

<110> INCYTE PHARMACEUTICALS, INC. 
BANDMAN, Glga 
HILLMAN, Jennifer L. 
TANG , Y. Tom 
LAL, Preeti 
YUE ( Henry 
BAUGHN , Marian R- 
LU, Dyung Aina M. 
AZIMZAI, Yalda 

<120> EXTRACELLULAR MATRIX AND ADHESION- ASSOCIATED PROTEINS 
<130> PF-0693 PCT 

<140> To Be Assigned , ( 

<141> Herewith 

<150> 60/133,643; 60/150,409 
<151> 1999-05-11; 1999-08-23 

<160> 50 

<17 0> PERL Program 

<210> 1 

<211> 309 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature ' i 

<223> Incyte ID No: 398269CD1 

*400> 1 

Met Val Phe Pro Ala Lys A 

1 5 
Gly Val Arg Trp Ala Phe S 
20 

Ala Glu Trp Leu Leu Ala V 
35 

Glu Arg He Gly Gin Phe V 
50 

Met Ala Gly Arg Leu Met I 
65 

Asn He Pro Trp Asn His I 
80 

Lys Pro Val Leu Ala Lys A 
95 

Asn Phe Asn He' Ser His G 
110 

Glu Pro Glu Leu Gin Val G 
125 

Pro Gly Arg Gly Ser He F 
140 

Lys Phe Thr Asn Lys Glu I 
155 

Glu Trp Thr Gin Leu Asp X. 

170 

Glu Ser Phe He Lys Ala 1 
185 

Gin Arg Leu Glu Phe Asp I 
200 

Gin Val Tyr Lys Glu Thr ; 

215 



Phe 


Cys 


Leu 


Val 


Pro 


Ser 


Met 


Glu 




10 










15 


Cys 


Gly 


Thr 


Trp 


Leu 


Pro 


Ser 


Arg 


25 










30 


Arg 


Ser 


He 


Gin 


Pro 


Glu 


Glu 


Lys 




40 










45 


Phe 


Ala 


Arg 


Asp 


Ala 


Lys 


Ala 


Ala 






55 








60' 


Arg 


Lys 


Leu 


Val 


Ala 


Glu 


Lys 


Leu 


70 










75 


Arg 


Leu 


Gin 


Arg 


Thr 


Ala 


Lys 


Gly 




85 










90 


Ser 


Ser 


Asn 


Pro 


Tyr 


Pro 


Asn 


Phe 






10O 










105 


Gly Asp 


Tyr 


Ala 


Val 


Leu 


Ala 


Ala 






115 










120 


He 


Asp 


He 


Met 


Lys 


Thr 


Ser 


Phe 




130 










135 


Glu 


Phe 


Phe 


His 


He 


Met 


Lys 


Arg 






145 










150 


Glu 


Thr 


lie 


Arg 


Ser 


Phe 


Lys 


Asp 






160 








165 


Phe 


Tyr 


Arg 


Asn 


Trp 


Ala 


Leu 


Lys 




175 










180 


Gly Val 


Gly 


Leu 


Gly 


Phe 


Glu 


Leu 






190 










195 


Ser 


Pro 


Leu 


Asn 


Leu 


Asp 


He 


Gly 






205 










210 


Leu 


Phe 


Leu 


Asp 


Gly 


Glu 


Glu 


Glu 






220 








225 


Ser 


Lys 


He 


Asp 


Glu 


His 


His 


Phe 
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230 235 240 

Val Ala Val Ala Leu Arg Lys Pro Asp Gly Ser Arg His Gin Asp 
„ 245 250 255 

val Pro Ser Gin Asp Asp Ser Lys Pro Thr Gin Arg Gin Phe Thr 

260 265 270 

He Leu Asn Phe Asn Asp Leu Met Ser Ser Ala Val Pro Met Thr 
■ 275 280 285 

Pro Glu Asp Pro Ser Phe Trp Asp Cys Phe Cys Phe Thr Glu Glu 

290 295 300 

He Pro He Arg Asn Gly Thr Lys Ser* 

305 

<210> 2 

<211> 554 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1258888CD1 

<400> 2 

Met Pro Leu Prp Trp Ser Leu Ala Leu Pro Leu Leu Leu Ser Trp 
'5 10 is 

Val Ala Gly Gly Phe Gly Asn Ala Ala Ser Ala Arg His His Gly 
20 25 30 

Leu Leu Ala Ser Ala Arg Gin Pro Gly Val Cys His Tyr Gly Thr 
35 40 45 

Lys Leu Ala Cys Cys Tyr Gly Trp Arg Arg Asn Ser Lys Gly Val 
50 55 60 

Cys Glu Ala Thr 'Cys Glu Pro Gly Cys Lys Phe Gly Glu Cys Val 
65 70 75 

Gly Pro Asn Lys Cys Arg Cys Phe Pro Gly Tyr Thr Gly Lys Thr 
80 85 90 

Cys Ser Gin Asp Val Asn Glu Cys Gly Met Lys Pro Arg Pro Cys 
i 95 100' 105 

Gin His Arg Cys Val Asn Thr His Gly Ser Tyr Lys Cys Phe Cys ' 

,110 115 120 

Leu Ser Gly His Met Leu Met Pro Asp Ala Thr Cys Val Asn Ser 
125 130 135 

Arg Thr Cys Ala Met He Asn Cys Gin Tyr Ser Cys Glu Asp Thr 
140 145 150 

Glu Glu Gly Pro Gin Cys Leu Cys Pro Ser Ser Gly Leu Arg Leu 
155 160 165 

Ala Pro Asn Gly Arg Asp Cys Leu Asp He Asp Glu Cys Ala Ser 

Gly Lys Val He Cys Pro Tyr Asn Arg Arg Cys Val Asn Thr Phe 

185 190 195 

Gly Ser Tyr Tyr Cys Lys Cys His He Gly Phe Glu Leu Gin Tyr 

200 205 210 

He Ser Gly Arg Tyr Asp Cys He Asp He Asn Glu Cys Thr Met 

215 220 225 

Asp Ser His Thr Cys Ser His His Ala Asn Cys Phe Asn Thr Gin 

230 235 240 

Gly Ser Phe Lys Cys Lys Cys Lys Gin Gly Tyr Lys Gly Asn Gly 

245 250 255 

Leu Arg Cys Ser Ala He Pro Glu Asn Ser Val Lys Glu Val Leu 

260 265 270 

Arg Ala Pro Gly Thr He Lys Asp Arg He Lys Lys Leu Leu Ala 

275 280 285 

His Lys Asn Ser Met Lys Lys Lys Ala Lys He Lys Asn Val Thr 

290 295 300 

Pro Glu Pro Thr Arg Thr Pro Thr Pro Lys Val Asn Leu Gin Pro 

Phe Asn Tyr Glu Glu He Val Ser Arg Gly Gly Asn Ser His Gly 

320 325 330 

Gly Lys Lys Gly Asn Glu Glu Lys Met Lys Glu Gly Leu Glu Asp 

335 340 345 
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Glu Lys Arg Glu Glu Lys Ala Leu Lys Asn Asp He Glu Glu Arg 

Ser Leu Arg Gly Asp Val Phe Phe Pro Lys Val Asn Glu Ala Gly 

Glu Phe Gly Leu lie Leu Val Gin Arg Lys Ala Leu Thr Ser ^ys 

Leu Glu His Lys Ala Asp Leu Asn lie Ser Val Asp Cys Ser Phe 
395 4 00 ^ v -> 

Asn His Gly lie Cys Asp Trp Lys Gin Asp Arg Glu Asp Asp Phe 

Asp Trp Asn Pro Ala Asp Arg Asp Asn Ala lie Gly Phe Tyr Met 
Asp y 425 430 435 

Ala Val Pro Ala Leu Ala Gly His Lys Lys Asp He Gly Arg Leu 
440. 445 « u 

Lys Leu Leu Leu Pro Asp Leu Gin Pro Gin Ser Asn Phe Cys Leu 

Leu Phe Asp Tyr Arg Leu Ala Gly Asp Lys Val Gly Lys Leu Arg 

470 475 
Val Phe Val Lys Asn Ser Asn Asn Ala Leu Ala Trp Glu Lys Thr 

Thr ser Glu Asp Glu Lys Trp Lys Thr Gly Lys lie Gin Leu Tyr 

Gin Gly Thr Asp Ala Thr Lys Ser lie lie Phe Glu Ala Glu Arg 

Gly Lys Gly Lys Thr Gly Glu He Ala Val Asp Gly Val Leu Leu 
530 535 34lJ 

Val Ser Gly Leu Cys Pro Asp Ser Leu Leu Ser Val Asp Asp 
545 ' 550 

<210> 3 
<211> 482 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

*223> Incyte ID No: 1375891CD1 

5it°Gly Cys Leu Trp Gly Leu Ala Leu Pro Leu Phe Phe Phe Cys 
TrJ Glu Val Gly Val Ser Gly Ser Ser Ala Gly Pro Ser Thr Arg 
Arg Ala Asp Thr Ala Met Thr Thr Asp Asp Thr Glu Val Pro Ala 



35 



40 45 
Met Thr Leu Ala Pro Gly His Ala Ala Leu Glu Thr Gin Thr Leu 

ser Ala Glu Thr Ser Ser Arg , Ala Ser Thr Pro Ala Gly Pro lie 

Pro Glu Ala Glu Thr Arg Gly Ala Lys Arg lie Ser Pro Ala Arg 

Glu Thr Arg Ser Phe Thr Lys Thr Ser Pro Asn Phe Met Val Leu 

lie Ala Thr Ser Val Glu Thr Ser Ala Ala Ser Gly Ser Pro Glu 

Gly Ala Gly Met Thr Thr Val Gin Thr He Thr Gly Ser Asp Pro 

125 130 

Glu Glu Ala He Phe Asp Thr Leu Cys Thr Asp Asp Ser Ser Glu 

140 145 l= u 

Glu Ala Lys Thr Leu Thr Met Asp lie Leu Thr Leu Ala. His Thr 

155 160 163 

Ser Thr Glu Ala Lys Gly Leu Ser Ser Glu Ser Ser Ala Ser Ser 

Asp Gly Pro His Pro Val lie Thr Pro Ser Arg Ala Ser Glu Ser 

Ser Ala Ser Ser Asp Gly Pro His Pro Val He Thr Pro Ser Arg 

200 205 
Ala ser Glu Ser Ser Ala Ser Ser Asp Gly Pro His Pro Val He 
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mu 215 220 "225 

Thr Pro Ser Trp Ser Pro Gly Ser Asp Val Thr Leu Leu Ala Glu 

230 235 240 

Ala Leu Val Thr Val Thr Asn He Glu Val He Asn Cys Ser lie 
245 250 255 

Thr Glu He Glu Thr Thr Thr Ser Ser He Pro Gly Ala Ser Asp 
260 265 270 

He Asp Leu He Pro Thr Glu Gly Val Lys Ala Ser Ser Thr Ser 
275 280 285 

. Asp Pro Pro Ala Leu Pro Asp Ser Thr Glu Ala Lys Pro His He 

290 295 300 

Thr Glu Val Thr Ala Ser Ala Glu Thr Leu Ser Thr Ala Gly Thr 
305 310 315 

Thr Glu Ser Ala Ala Pro His Ala Thr Val Gly Thr Pro Leu Pro 

320 325 330 ' 

Thr Asn Ser Ala Thr Glu Arg Glu Val Thr Ala Pro Gly Ala Thr 
335 340 . . 345 

Thr Leu Ser Gly Ala Leu Val Thr Val Ser Arg Asn Pro Leu Glu 
350 355 260 

Glu Thr Ser Ala Leu Ser Val Glu Thr Pro Ser Tyr Val Lys Val 
365 370 375 

Ser Gly Ala Ala Pro Val Ser lie Glu Ala Gly Ser Ala Val Glv 
• 380 385 ' 390 

Lys Thr Thr Ser Phe Ala Gly Ser Ser Ala Ser Ser Tyr Ser Pro 
395 400 405 

Ser Glu Ala Ala Leu Lys Asn Phe Thr Pro Ser Glu Thr Pro Thr 
410 4 15 420 

Met Asp He Ala Thr Lys Gly Pro Phe Pro Thr Ser Arg Asp Prb 
425 430 435 

Leu Pro Ser Val Pro Pro Thr Thr Thr Asn Ser Ser Arg Gly Thr 
440 445 450 

Asn Ser Thr Leu Ala Lys He Thr Thr Ser Ala Lys Thr Thr Met 

455 460 4 6 5 , 1 

Lys Pro Gin Gin Pro Arg Pro Arg Leu Pro Gly Arg Gly Arg Pro 

470 475 ' 4ftn 

Gin Thr ; 480 

<210> 4 

<211> 735 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc__ feature 

<223> Incyte ID No: 15243 55CD1 

<400> 4 

Met Ala Ala Gly Gly Ala \ 

1 5 
Leu Pro Tyr Ala Leu His L 
20 

Leu Pro Glu Asn He Leu V 
35 

Arg Trp Ser Ser Glu Ser A 
50 

Lys Leu Glu Arg Pro Ala I 
65 

Tyr Glu Lys Thr His Val C 
80 

Gly Gly Met Asn Glu Glu A 
95 

Leu Lys Asn Asp Tyr Asn L' 
110 

He Asp Glu Gin Met Phe P: 
125 

Leu Leu Ser Trp Gly Pro S< 
140 



Ala 


Ala 


Ala 


Pro 


Glu 


Cys 


Arg 


Leu 






10 








15 


Trp 


Ser 


Ser 
25 


Phe 


Ser 


Ser 


Thr 


Tyr 
30 


Asp 


Lys 


Pro 
40 


Asn 


Asp 


Gin 


Ser 


Ser 
45 


Tyr 


Pro 


Pro 


Gin 


Tyr 


Leu 


He 


Leu 






55 








60 


Val 


Gin 


Asn 


He 


Thr 


Phe 


Gly Lys 






70 










75 


Asn 


Leu 


Lys 


Lys 


Phe 


Lys 


Val 


Phe 






85 








90 


Met 


Thr 


Glu 


Leu 


Leu 


Ser 


Ser 


Gly 


Glu 




100 










105 


Thr 


Phe 
115 


Thr 


Leu 


Lys 


His 


Lys 
120 


Cys 


Arg 


Phe 


He 


Lys 


He 


Val 


Pro 


Phe 




130 








135 


Asn 


Phe 


Ser 


He 


Trp 


Tyr 


Val 






145 






150 
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Glu Leu Ser Gly lie Asp Asp Pro Asp lie Val Gin Pro Cys Leu 

155 16° 103 

Asn Trp Tyr Ser Lys Tyr Arg Glu Gin Glu Ala lie Arg Leu Cys 

170 175 180 

Leu Lys His Phe Arg Gin His Asn Tyr Thr Glu Ala Phe Glu ?er 

185 190 
Leu Gin Lys Lys Thr Lys lie Ala Leu Glu His Pro Met Leu Thr 

200 205 210 

Asp He His Asp Lys Leu Val Leu Lys Gly Asp Phe Asp Ala Cys 

Glu Glu Leu He Glu Lys Ala Val Asn Asp Gly Leu Phe Asn Gin 
230 235 < J4t) 

Tyr He Ser Gin Gin Glu Tyr Lys Pro Arg Trp Ser Gin He lie 
245 250 255 

Pro Lys Ser Thr Lys Gly Asp Gly Glu Asp Asn Arg Pro Gly Met 
260 265 270 

Arg Gly Gly His Gin Met Val He Asp Val Gin Thr Glu Thr Val 

275 280 
Tyr Leu Phe Gly Gly Trp Asp Gly Thr Gin Asp Leu Ala Asp Phe . , • 

290 295 300 

Tro Ala Tyr Ser Val Lys Glu Asn Gin Trp Thr Cys He Ser Arg 
305, 310 . 315 

Asp Thr Glu Lys Glu Asn Gly Pro Ser Ala Arg Ser Cys His Lys 
320 325 V , "° 

Met cys He Asp He Gin Arg Arg Gin He Tyr Thr Leu Gly Arg 
335 340 J4 = 

Tyr Leu Asp Ser Ser Val Arg Asn Ser Lys Ser Leu Lys Ser Asp 
Y 350 355 360 

Phe Tyr Arg Tyr Asp He Asp Thr Asn Thr Trp Met Leu Leu Ser 
365 370 3/b 

Glu Asp Thr Ala Ala Asp Gly Gly Pro Lys Leu Val Phe Asp His 

380 385 
Gin Met Cys Met Asp Ser Glu Lys His Met He Tyr Thr Phe Gly 
395 400 405 

Gly Arg He Leu Thr Cys Asn Gly Ser Val Asp Asp Ser Arg Ala 

410 415 , 

Ser Glu Pro Gin Phe Ser Gly Leu Phe Ala Phe Asn Cys Gin Cys 

425 430 435. 

Gin Thr Trp Lys Leu Leu Arg Glu Asp Ser Cys Asn Ala Gly Pro 
440 445 4bD 

Glu Asp He Gin Ser Arg He Gly His Cys Met Leu Phe His Ser 
455 460 463 

Lys Asn Arg Cys Leu Tyr Val Phe Gly Gly Gin Arg Ser Lys Thr 
470 475 «isu 

Tyr Leu Asn Asp Phe Phe Ser Tyr Asp Val Asp Ser Asp His Val 

485 490 
Asp He He Ser Asp Gly Thr Lys Lys Asp Ser Gly Met Val Pro 
500 50S 510 

Met Thr Gly Phe Thr Gin Arg Ala Thr He Asp Pro Glu Leu Asn 
515 520 =25 

Glu He His Val Leu Ser Gly Leu Ser Lys Asp Lys Glu Lys Arg 
530 535 54U 

Glu Glu Asn Val Arg Asn Ser Phe Trp He Tyr Asp He Val Arg 
545 550 =55 

Asn Ser Trp Ser Cys Val Tyr Lys Asn Asp Gin Ala Ala Lys Asp 
560 565 5/u 

Asn Pro Thr Lys Ser Leu Gin Glu Glu Glu Pro Cys Pro Arg Phe 
575 580 585 

Ala His Gin Leu Val Tyr Asp Glu Leu His Lys Val His Tyr Leu 
590 595 600 

Phe Gly Gly Asn Pro Gly Lys Ser Cys Ser Pro Lys Met Arg Leu 
605 610 615 

Asp Asp Phe Trp Ser Leu Lys Leu Cys Arg Pro Ser Lys Asp Tyr 
620 625 630 

Leu Leu Arg His Cys Lys Tyr Leu He Arg Lys His Arg Phe Glu 
635 640 V 645 

Glu Lys Ala Gin Val Asp Pro Leu Ser Ala Leu Lys Tyr Leu Gin 



5/43 



BNSDOCID. <WO 0066380A2J_> 



v vu ' vwou PCT/USOO/1 281 1 

650 655 ' 660 

Asn Asp Leu Tyr lie Thr Val Asp His Ser Asp Pro Glu Glu Thr 

665 670 675 

Lys Glu Phe Gin Leu Leu Ala Ser Ala Leu Phe Lys Ser Gly Ser 

680 685 69Q 

Asp Phe Thr Ala Leu Gly Phe Ser Asp Val Asp His Thr Tyr Ala 

695 700 7 0 5 

Gin Arg Thr Gin Leu Phe Asp Thr Leu Val Asn Phe Phe Pro Asp 

710 715 72 *J 

Ser Met Thr Pro Pro Lys Gly Asn Leu Val Asp Leu He Thr Leu . 

725 730 - 735 

<211> 424 
<212> PRT 

<213> Homo sapiens • 
<220> 

<221> misc_feature ' 
<223> Incyte ID No: 1598937CD1 

<400> 5 

Met Ala Pro Glu Glu Asp Ala Gly Gly Glu Ala Leu Gly Gly Ser 

^ ■ ' 5 10 15 

Phe Trp Glu Ala Gly Asn Tyr Arg Arg Thr Val Gin Arg Val Glu 

20 25 30 

Asp Gly His Arg Leu Cys Gly Asp Leu Val Ser Cys Phe Gin Glu 

Arg Ala Arg He Glu Lys Ala Tyr Ala Gin Gin Leu Ala Asp T^ 
50 55 ^ 6 £ 

Ala Arg Lys Trp Arg Gly Thr Val Glu Lys Gly Pro Gin Tyr Gly 
65 70 75 

Thr Leu Glu Lys Ala Trp His Ala Phe Phe Thr Ala Ala Glu Ara 

80 85 90 • ' 

Leu Ser Ala Leu His Leu Glu Val Arg Glu Lys Leu Gin Gly Gin 

Asp Ser Glu Arg Val Arg Ala Trp Gin Arg Gly Ala Phe His Ara' 
I 10 H5 inn 

Pro Val Leu Gly Gly Phe Arg Glu Ser Arg Ala Ala Glu Asp Gly 

125 130 135 

Phe Arg Lys Ala Gin Lys Pro Trp Leu Lys Arg Leu Lys Glu Val 

140 145 150 

Glu Ala Ser Lys Lys Ser Tyr His Ala Ala Arg Lys Asp Glu Lys 

155 160 165 

Thr Ala Gin Thr Arg Glu Ser His Ala Lys Ala Asp Ser Ala Val 

170 175 ifio 

Ser Gin Glu Gin Leu Arg Lys Leu Gin Glu Arg Val Glu Arg Cys 

185 I 90 195 

Ala Lys Glu Ala Glu Lys Thr Lys Ala Gin Tyr Glu Gin Thr Leu 

200 205 210 

Ala Glu Leu His Arg Tyr Thr Pro Arg Tyr Met Glu Asp Met Glu 

215 220 225 

Gin Ala Phe Glu Thr Cys Gin Ala Ala Glu Arg Gin. Arg Leu Leu 

230 235 240 

Phe Phe Lys Asp Met Leu Leu Thr Leu His Gin His Leu Asp Leu ' 

245 250 255 

Ser Ser Ser Glu Lys Phe His Glu Leu His Arg Asp Leu His Gin 

260 265 270 

Gly lie Glu Ala Ala Ser Asp Glu Glu Asp Leu Arg Trp Trp Arg 

275 280 285 

Ser Thr His Gly Pro Gly Met Ala Met Asn Trp Pro Gin Phe Glu 

Glu Trp Ser Leu Asp Thr Gin Arg Thr lie Ser Arg Lys Glu Lys 

305 310 
Gly Gly Arg Ser Pro Asp Glu Val Thr Leu Thr Ser lie Val Pro 

320 325 330 

Thr Arg Asp Gly Thr Ala Pro Pro Pro Gin Ser Pro Gly Ser Pro 

335 340 345 
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Gly Thr Gly Gin Asp Glu Glu Trp Ser Asp Glu Glu Ser Pro Arg 
Lys Ala Ala Thr Gly Val Arg Val Arg Ala Leu Tyr Asp Tyr Ala. 
Gly Gin Glu Ala Asp Glu Leu Ser Phe Arg Ala Gly Glu Glu Leu 

Leu Lys Met Ser Glu Glu Asp Glu Gin Gly Trp Cys Gin Gly. Gin 

395 400 • ^ u:j 

Leu Gin Ser Gly Arg lie Gly Leu Tyr Pro Ala Asn Tyr Val Glu 

410 415 " u 

Cys Val Gly Ala 

<210> 6 

<211> 420 

<212> PRT 

<213> Homo sapiens 

<220> , 

<221> misc_f eature ' • ■ 

<223> incyte ID No: 1725801CD1 

Met°Ala Pro Trp Pro Pro Lys Gly Leu Val Pro Ala Val Leu Trp 

Gly Leu Ser Leu Phi Leu Asn Leu Pro Gly Pro He Trp Leu Gin 

Pro ser Pro Pro Pro Gin Ser Ser Pro Pro Pro Gin Pro His Pro 
3 5 4 0 4 15 

Cys His Thr cys Arg Gly Leu Val Asp Ser Phe Asn Lys Gly Leu 

Glu Arg Thr lie Arg Asp Asn Phe Gly Gly Gly Asn Thr Ala Trp 



65 



70 75 
Glu Glu Glu Asn Leu Ser Lys Tyr Lys Asp Ser Glu Thr Arg Leu 

Val Glu Val Leu Glu Gly Val Cys Ser Lys Ser Asp Phe Glu Cys 

His Arg Leu Leu Glu Leu Ser Glu Glu Leu Val Glu Ser Trp Trp 

Phe His Lys Gin Gin Glu Ala Pro Asp Leu Phe Gin Trp Leu Cys 

Ser Asp Ser Leu Lys Leu Cys Cys Pro Ala Gly Thr Phe Gly Pro- 
- - - 145 1:>U 



Ser Cys Leu Pro Jys Pro Gly Gly Thr Glu Arg Pro Cys Gly Gly 



Tyr Gly Gin Cys Glu Gly Glu Gly "Thr Arg Gly Gly Ser Gly His 



160 165 



170 



175 18° 



cys Asp cys Gin Ala Gly Tyr Gly Gly Glu Ala Cys Gly Gin Cys 



18 5 -* " ~ 

Gly Leu Gly Tyr Phe Glu Ala Glu Arg Asn Ala Ser His Leu Val 



Cys Ser Ala cys Phe Gly Pro Cys Ala Arg Cys Ser Gly Pro Glu 



205 210 



215 



220 225 



Glu Ser Asn Cys Leu Gin Cys Lys Lys Gly Trp Ala Leu His His 



Leu Lys Cys Val Asp He Asp Glu Cys Gly Thr Glu Gly Ala Asn 

Cys Gly Ala Asp Gin Phe cys Val Asn Thr Glu Gly Ser Tyr Glu 

Cys Arg Asp Cys Ala Lys Ala Cys Leu Gly Cys Met Gly Ala Gly 

J 275 280 

Pro Gly Arg Cys Lys Lys Cys Ser Pro Gly Tyr Gin Gin Val Gly 

ser Lys Cys Leu Asp Val Asp Glu Cys Glu Thr Glu Val Cys Pro 
305 310 315 

Gly Glu Asn Lys Gin Cys Glu Asn Thr Glu Gly Gly Tyr Arg Cys 
320 325 JJU 

He Cys Ala Glu Gly Tyr Lys Gin Met Glu Gly He Cys Val Lys 
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_ _ 33S 340 345 

Glu Gin He Pro Glu Ser Ala Gly Phe Phe Ser Glu Met Thr Glu 

350 355 36Q 

Asp Glu Leu Val Val Leu Gin Gin Met Phe Phe Gly He He He 

365 370 375 

Cys Ala Leu Ala Thr Leu Ala Ala Lys Gly Asp Leu Val Phe Thr 

380 385 3Q0 

Ala He Phe lie Gly Ala Val Ala Ala Met Thr Gly Tyr Trp Leu 

395 400 4os 

Ser Glu Arg Ser Asp Arg Val Leu Glu Gly Phe He Lys Gly Arg 

o 410 415 420 

<210> 7 

<211> 795 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature ' ' 

<223> Incyte ID No: 1730482CD1 

<400> 7 

Met Glu Lys Thr Gin Ser Leu Pro Thr Arg Pro Pro Thr Phe Pro 
Pro Thr He Pro Pro Ala Lys Glu Val cjs Lys Ala Ala Lys Ala 

Asp Leu Val Phe Met Val Asp Gly Ser Trp Ser He Gly Asp Glu 
35 40 4c 

Asn Phe Asn Lys lie He ser Phe Leu Tyr Ser Thr Val Gly Ala 
50 55 60 

Leu Asn Lys lie Gly Thr Asp Gly Thr Gin Val Ala Met Val Gin 

Phe Thr Asp Asp Pro Arg Thr Glu Phe Lys Leu Asn Ala Tyr iZt 

80 85 on ' 

Thr Lys Glu Thr Leu Leu Asp Ala He Lys His lie ser Tyr Lys 
95 100. loc 

Gly Gly . Asn Thr Lys Thr Gly Lys Ala He Lys Tyr Val Arg Asp ' 

Thr Leu Phe Thr Ala Glu Ser Gly Thr Arg Arg Gly He Pro JyS 

Val lie Val Val lie Thr Asp Gly Arg Ser Gin Asp Asp Val Asn 
140 145 i5o 

Lys He Ser Arg Glu Met Gin Leu Asp Gly Tyr ser He Phe Ala 
155 160 265 

He Gly Val Ala Asp Ala Asp Tyr Ser Glu Leu Val Ser He Gly 

Ser Lys Pro Ser Ala Arg His Val Phe Phe Val Asp Asp Phe Asp 

I 85 190 195 

Ala Phe Lys Lys lie Glu Asp Glu Leu He Thr Phe Val Cys Glu 

2 °0 205 210 

Thr Ala Ser Ala Thr Cys Pro Val Val His Lys Asp Gly He Asp 

215 220 ?25 

Leu Ala Gly Phe Lys Met Met Glu Met Phe Gly Leu Val Glu Lvs 

™_ 230 235 240' 

Asp Phe Ser Ser Val Glu Gly Val Ser Met Glu Pro Gly Thr Phe 

245 250 255 

Asn Val Phe Pro Cys Tyr Gin Leu His Lys Asp Ala Leu Val Ser 

260 265 270 

Gin Pro Thr Arg Tyr Leu His Pro Glu Gly Leu Pro Ser Asp Tyr 

275 280 285 

Thr He Ser Phe Leu Phe Arg He Leu Pro Asp Thr Pro Gin Glu 

290 295 300 

Pro Phe Ala Leu Trp Glu He Leu Asn Lys Asn Ser Asp Pro Leu 

3 °5 310 315 

Val Gly Val He Leu Asp Asn Gly Gly Lys Thr Leu Thr Tyr Phe 

320 325 330 

Asn Tyr Asp Gin Ser Gly Asp Phe Gin Thr Val Thr Phe Glu Gly 
33 5 340 345 
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Pro Glu He Arg Lys lie Phe Tyr Gly Ser Phe His Lys Leu His 
350 355 360 

He Val Val Ser Glu Thr Leu Val Lys Val Val He Asp Cys Lys 
365 370 375 

Gin Val Gly Glu Lys Ala Met Asn Ala Ser Ala Asn He Thr Ser 
380 385 390 

Asp Gly Val Glu Val Leu Gly Lys Met Val Arg Ser Arg Gly Pro 
P y 39S 400 405 

Glv Gly Asn Ser Ala Pro Phe Gin Leu Gin Met Phe Asp He Val 
y 410 415 420 

Cvs Ser Thr Ser Trp Ala Asn Thr Asp Lys Cys Cys Glu Leu Pro. 

425 430 435 

Gly Leu Arg Asp Asp Glu Ser Cys Pro Asp Leu Pro His Ser cys 
440 445 450 

Ser Cys Ser Glu Thr Asn Glu Val Ala Leu Gly Pro Ala Gly Pro ( 

y 455 460 465 

Pro Gly Gly Pro Gly Leu Arg Gly Pro Lys Gly Gin Gin Gly Glu 
470 475 480 

Pro Glv Pro Lys Gly Pro Asp Gly Pro Arg Gly Glu He Gly Leu 
485 490 495 

Pro Gly Pro Gin Gly Pro Pro Gly Pro Gin Gly Pro Ser Gly Leu 
500 505 510 

Ser He Gin Gly Met Pro Gly Met Pro Gly Glu Lys Gly Glu Lys 
515 520 525 

Gly Asp Thr Gly Leu Pro Gly Pro Gin Gly He Pro Gly Gly Val 
' 530 535 540 

Gly Ser Pro Gly Arg Asp Gly Ser Pro Gly Gin Arg Gly Leu Pro 
545 550 555 

Gly Lys Asp Gly Ser Ser Gly Pro Pro Gly Pro Pro Gly Pro He • 

560 565 570 

Gly He Pro Gly Thr Pro Gly Val Pro Gly He Thr Gly Ser Met 
y 57.5 580 585 

Glv Pro Gin Gly Ala Leu Gly Pro Pro Gl*y Val Pro Gly Ala Lys 
59O 595 600 

Gly Glu Arg Gly Glu Arg Gly Asp Leu Gin Ser Gin Ala Met Val 
y 605 610 615 

Arg Ser Val Ala Arg Gin Val Cys Glu Gin Leu He Gin Ser His , 

y 620 625 630 

Met Ala Arg Tyr Thr Ala He Leu Asn Gin He Pro Ser His Ser 
635 640 645 

Ser Ser He Arg Thr Val Gin Gly Pro Pro Gly Glu Pro Gly Arg 
650 655 660 

Pro Gly Ser Pro Gly Ala Pro Gly Glu Gin Gly Pro Pro Gly Thr 
665 670 675 

Pro Gly Phe Pro Gly Asn Ala Gly Val Pro Gly Thr Pro Gly Glu 
680 685 690 

Arg Gly Leu Thr Gly lie Lys Gly Glu Lys Gly Asn Pro Gly Val 
* 695 700 705 

Glv Thr Gin Gly Pro Arg Gly Pro Pro Gly Pro Ala Gly Pro Ser 
710 715 720 

Gly Glu Ser Arg Pro Gly Ser Pro Gly Pro Pro Gly Ser Pro Gly 
725 730 735 

Pro Arg Gly Pro Pro Gly His Leu Gly Val Pro Gly Pro Gin Gly 
74 0 7 4 5 7 50 

Pro Ser Gly Gin Pro Gly Tyr Cys Asp Pro Ser Ser Cys Ser Ala 
755 760 765 

Tvr Gly Val Arg Ala Pro His Pro Asp Gin Pro Glu Phe Thr Pro 
Y y 770 77 5 7 8° 

Val Gin Asp Glu Leu Glu Ala Met Glu Leu Trp Gly Pro Gly Val 
785 7 ^0 7 *5 

<210> 8 

<211> 306 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
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<400> 8 

Met Arg He Trp Trp Leu Leu Leu Ala He Glu He Cys Thr Gly 

5 10 15 

Asn He Asn Ser Gin Asp Thr Cys Arg Gin Gly His Pro Gly He 

20 25 30 

Pro Gly Asn Pro Gly His Asn Gly Leu Pro Gly Arg Asp Gly Arg 

35 40 '45 

Asp Gly Ala Lys Gly Asp Lys Gly Asp Ala Gly Glu Pro Gly Ara 

50 55 J 6 g 

Pro Gly Ser Pro Gly Lys Asp Gly Thr Ser Gly Glu Lys Gly Glu 

65 70 75 

Arg Gly Ala Asp Gly Lys Val Glu Ala Lys Gly He Lys Gly Asd 

80 85 90 

Gin Gly Ser Arg Gly Ser Pro Gly Lys His Gly Pro, Lys Gly Leu 

95 100 105 

Ala Gly Pro Met Gly Glu Lys Gly Leu Arg Gly Glu Thr Gly Pro 

110 H5 120 

Gin Gly Gin Lys Gly Asn Lys Gly Asp Val Gly Pro Thr Gly Pro 

125 130 135 

Glu Gly Pro Arg Gly Asn He Gly Pro Leu Gly Pro Thr Gly Leu 

■ « 140 145 150 

Pro Gly Pro Met Gly Pro He Gly Lys Pro Gly Pro Lys Gly Glu 

155 160 165 

Ala Gly Pro Thr Gly Pro Gin Gly Glu Pro Gly Val Arg Gly He 

170 175. * 180 

Arg Gly Trp Lys Gly Asp Arg Gly Glu Lys Gly Lys He Gly Glu 

185 190 195 

Thr Leu Val Leu Pro Lys Ser Ala Phe Thr Val Gly Leu Thr Val 

200 205 210 

Leu Ser Lys Phe Pro Ser Ser Asp Val Pro He Lys Phe Asp Lys 

215 220 225 

He His He Thr Val Phe Ser Arg Asn Val Gin .Val Ser Leu Val 

230 235 240 

Lys Asn Gly Val Lys He Leu His Thr Arg Asp Ala Tyr Val Seir 

245 250 255 

Ser Glu Asp Gin Ala Ser Gly Ser He Val Leu Gin Leu Lys Leu 

260 265 270 

Gly Asp Glu Met Trp Leu Gin Val Thr Gly Gly Glu Arg Phe Asn 

275 280 285 

Gly Leu Phe Ala Asp Glu Asp Asp Asp Thr Thr Phe Thr Gly Phe 

290 295 300 

Leu Leu Phe Ser Ser Gin 

305 

<210> 9 
<211> 338 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2040679CD1 

<400> 9 

Met Tyr Val Leu Ser Pro Val Glu Phe He He Leu Gin Leu Leu 

5 1° 15 

Phe He Gin Ala He Ser Ser Ser Leu Lys Gly Phe Leu Ser Ala 

20 25 30 

Met Arg Leu Ala His Arg Gly Cys Asn Val Asp Thr Pro Val Ser 

35 40 45 

Thr Leu Thr Pro Val Lys Thr Ser Glu Phe Glu Asn Phe Lys Thr 

50 55 60 

Lys Met Val He Thr Ser Lys Lys Asp Tyr Pro Leu Ser Lys Asn 

65 70 75 

Phe Pro Tyr Ser Leu Glu His Leu Gin Thr Ser Tyr Cys Gly Leu 

80 85 y 90 
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val Arg Val Asp Met Arg Met Leu Cys Leu Lys Ser Leu Arg Lys 

Leu Asp Leu Ser His Asn His lie Lys Lys Leu Pro Ala Thr lie 
Gly Asp Leu lie His Leu Gin Glu Leu Asn Leu Asn Asp Asn His 
Leu Glu Ser Phe III Val Ala Leu Cys His Ser Thr Leu Gin Lys 
Ser Leu Arg Ser Leu Asp Leu Ser Lys Asn Lys lie Lys Ala Leu 
Pro Val Gin Phe Cys Gin Leu Gin Glu Leu Lys Asn Leu Lys Leu 
Asp Asp Asn Glu III He Gin Phe Pro Cys Lys He Gly Gin 
lie Asn Leu Arg Phe Leu Ser Ala Ala Arg Asn Lys Leu Pro Phe ( 
Leu Pro Ser Glu Phe Arg Asn Leu Ser Leu Glu Tyr Leu Asp Leu 
Phe Gly Asn Thr Ihl Glu Gin Pro Lys Val Leu Pro Val lie Lys , , 
Leu Gin Ala Pro III Thr Leu Leu Glu Ser Ser Ala Arg Thr lie 
Leu His Asn Arg Ue Pro Tyr Gly Ser His He lie Pro Phe .His 
Leu Cys Gin Asp Leu Asp Thr Ala Lys lie Cys Val Cys Gly Arg 
Phe Cys Leu Asn III Phe He Gin Gly Thr Thr Thr Met Asn Leu 

His Ser Val Ala His Thr Val Val Leu Val Asp Asn Leu Gly Gly. 

305 310 
Thr Glu Ala Pro lie lie Ser Tyr Phe Cys Ser Leu Gly Cys Tyr 

320 325 ^ u 

Val Asn Ser Ser Asp Met Leu Lys ' 

335 

<210> 10 

<211> 164 , . , . . 

<212> PRT /' 
<213> Homo sapiens 

<220> ' 

<221> misc_f eature 

<223> incyte ID No: 2960051CD1 

Me? 0 Lys°ile Ala Val Leu Phe Cys Phe Phe Leu Leu lie lie Phe 



Gin Thr Asp Phe Gly Lys Asn Glu Glu lie Pro Arg Lys Gin Arg 



20 



25 



Arg Lys He Tyr His Arg Arg Leu Arg Lys Ser Ser Thr Ser His 



Lys His Arg Ser Asn Arg Gin Leu Gly lie Pro Gin Thr Thr Val 



50 



Phe Thr Pro Val Ala Arg Leu Pro lie Val Asn Phe Asp Tyr Ser 



65 



Met Glu Glu Lys Phe Glu Ser Phe Ser Ser Phe Pro. Gly- Val Glu 



Ser ser Tyr Asn vll Leu Pro Gly Lys Lys Gly His Cys Leu Val 



95 



100 



Lys Gly He Thr Met Tyr Asn Lys Ala Val Trp Ser Pro Glu Pro 
Cys Thr Thr Cys llu Cys Ser Asp Gly Arg Val Leu Cys Asp Glu 
Thr Met Cys His Pro Gin Arg Cys Pro Gin Thr Val lie Pro Glu 



Gly Glu <?ys Cys Pro Val Cys Ser Ala Thr Gly Thr Glu lie 
155 160 



<210> H 
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<211> 327 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3117318CD1 

<400> 11 f 
Met Arg Ala Leu Pro Gly Leu Leu Glu 'Ala Arg Ala Arg Thr Pro 
1 5 10 15 

Arg Leu Leu Leu Leu Gin Cys Leu Leu Ala Ala Ala Arg Pro Ser 
20 25 30 

Ser Ala Asp Gly Ser Ala Pro Asp Ser Ala Phe Thr Ser Pro Pro 
35 ,40 45 

Leu Arg Glu Glu lie Met Ala Asn Asn Phe Ser Leu Glu Ser His 
50 55 ■ . 60 

Asn He Ser Leu Thr Glu His Ser Ser Met Pro Val Glu Lys Asn 
65 70 75 

He Thr Leu Glu Arg Pro Ser Asn Val Asn Leu Thr Cys Gin Phe 
80 85 90" 

Thr Thr Ser Gly Asp Leu Asn Ala Val Asn Val Thr Trp Lys Lys 
' ' 95 100 105 

Asp Gly Glu Gin Leu Glu Asn Asn Tyr Leu Val Ser Ala Thr Glv 
110 H5 120 

Ser Thr Leu Tyr Thr Gin Tyr Arg Phe Thr He He Asn Ser Lys 
125 130 135 

Gin Met Gly Ser Tyr Ser Cys Phe Phe Arg Glu Glu Lys Glu Gin 
I 40 145 150 

Arg Gly Thr Phe Asn Phe Lys Val Pro Glu Leu His Gly Lys Asn 
155 160 165 

Lys Pro Leu He Ser Tyr Val Gly Asp Ser Thr Val Leu Thr Cys 
170 175 180 

Lys Cys Gin Asn Cys Phe Pro Leu Asn Trp Thr. Trp Tyr Ser Ser 
185 190' 195 

Asn Gly Ser Val Lys Val Pro Val Gly Val Gin Met Asn Lys Tyr 
•200 205 210 

Val He Asn Gly Thr Tyr Ala Asn Glu Thr Lys Leu Lys He Thr 
215 220 225 

Gin Leu Leu Glu Glu Asp Gly Glu Ser Tyr Trp Cys Arg Ala Leu 
230 235 240 

Phe Gin Leu Gly Glu Ser Glu Glu His He Glu Leu Val Val Leu 
245 250 255 

Ser Tyr Leu Val Pro Leu Lys Pro Phe Leu Val He Val Ala Glu 
260 265 270 

Val lie Leu Leu Val Ala Thr He Leu Leu Cys Glu Lys Tyr Thr 
, 275 280 285 

Gin Lys Lys Lys Lys His Ser Asp Glu Gly Lys Glu Phe Glu Gin 
290 295 300 

He Glu Gin Leu Lys Ser Asp Asp Ser Asn Gly He Glu Asn Asn 
305 310 315 

Val Pro Arg His Arg Lys Asn Glu Ser Leu Gly Gin 
320 325 

<210> 12 

<211> 716 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3486992CD1 

<400> 12 

Met Ala Arg Met Ser Phe Val lie Ala Ala Cys Gin Leu Val Leu 

1 5 10 15 

Gly Leu Leu Met Thr Ser Leu Thr Glu Ser Ser He Gin Asn Ser 

20 25 30 
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Glu Cys Pro Gin Leu Cys Val Cys Glu lie Arg Pro Trp Phe Thr 

Pro Gin ser Thr Tyr Arg Glu Ala Thr Thr Val Asp cys Asn Asp 
Leu Arg Leu Thr Arg He Pro Ser Asn Leu Ser Ser Asp Thr Gin 
Val Leu Leu Leu Gin Ser Asn Asn lie Ala Lys Thr Val Asp Glu 
Leu Gin Gin Leu Phe Asn Leu Thr Glu Leu Asp Phe Ser Gin Asn 
Asn Phe Thr Asn lie Lys Glu Val Gly Leu Ala Asn Leu Thr Gin- 
Leu Thr Thr Leu His Leu Glu Glu Asn Gin He Thr Glu Met Thr 
Asp Tyr Cys Leu Gin Asp Leu ser Asn Leu Gin Glu Leu Tyr lie 
Asn His Asn Gin lie Ser Thr lie Ser Ala His Ala Phe Ala Gly 
L eu Lys Asn Leu Leu Arg Leu His Leu Asn Ser Asn Lys Leu Lys , 
Val lie Asp Ser Arg Trp Phe Asp Ser Thr Pro Asn Leu Glu lie 
Leu Met lie Gly Glu Asn Pro Val He Gly He Leu Asp Met. Asn 
Phe Lys Pro Leu Ala Asn Leu Arg Ser Leu Val Leu Ala Gly Met 
Tyr Leu Thr Asp III Pro Gly Asn Ala Leu Val Gly Leu Asp Ser 
L eu Glu Ser Leu Ser Phe Tyr Asp Asn Lys Leu Val Lys Val Pro. 
Gin Leu Ala Leu Gin Lys Val Pro Asn Leu Lys Phe Leu Asp Leu 
Asn Lys Asn Pro He His Lys lie Gin Glu Gly Asp Phe Lys Asn , 
Met Leu Arg Leu Lys Glu Leu Gly He Asn Asn Met Gly Glu Leu 

Val Ser Val Asp III Tyr Ala Leu Asp Asn Leu Pro Glu Leu Thr , 

3 05 3 1 U 

Lys Leu Glu Ala Thr Asn Asn Pro Lys 'Leu Ser Tyr He His Arg 

Leu Ala Phe Arg Ser Val Pro Ala Leu Glu Ser Leu Met Leu ASn 

Asn Asn Ala Leu Asn Ala lie Tyr Gin Lys Thr Val Glu Ser Leu 

Pro Asn Leu Arg Glu lie Ser He His ser Asn Pro Leu Arg Cys 

Asp cys Val He III Trp He Asn Ser Asn Lys Thr Asn He Arg 

Phe Met Glu Pro Leu Ser Met Phe Cys Ala Met Pro Pro Glu Tyr 

Lys Gly His Gin Val Lys Glu Val Leu lie Gin Asp Ser Ser Glu 

Gin cys Leu Pro Met lie Ser His Asp Ser Phe Pro Asn Arg Leu 

Asn Val Asp lie G^y Thr Thr Val Phe Leu Asp Cys Arg Ala Met 
440 44b Z 

Ala Glu Pro Glu Pro Glu He Tyr Trp Val Thr Pro He Gly Asn 
455 460 o 

Lys lie Thr Val Glu Thr Leu Ser Asp Lys Tyr Lys Leu Ser Ser 

Glu Gly Thr Leu clu He Ser Asn He Gin He Glu Asp Ser Gly 

Arg Tyr Thr Cys Val Ala Gin Asn Val Gin Gly Ala Asp Thr Arg 

Val Ala Thr He Lys val Asn Gly Thr Leu Leu Asp Gly Thr Gin 

Val Leu Lys He Tyr Val Lys Gin Thr Glu Ser His Ser He Leu 
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530 535 540 

Val Ser Trp Lys Val Asn Ser Asn Val Met Thr Ser Asn Leu Lys 

545 550 555 

Trp Ser Ser Ala Thr Met Lys He Asp Asn Pro His He Thr Tyr 

560 565 570 

Thr Ala Arg Val Pro Val Asp Val His Glu Tyr Asn Leu Thr His 

575 580 585 

Leu Gin Pro Ser Thr Asp Tyr Glu Val Cys Leu Thr Val Ser Asn 

590 595 600 

He His Gin Gin Thr Gin Lys Ser Cys Val Asn Val Thr Thr Lys 

605 610. 615 

Asn Ala Ala Phe Ala Val Asp He Ser Asp Gin Glu Thr Ser Thr 

620 625 630 

Ala Leu Ala Ala Val Met Gly Ser Met Phe Ala Val He Ser Leu 

635 640 645 

Ala Ser He Ala Val Tyr Phe Ala Lys Arg Phe Lys Arg Lys Asn 

650 655 660 

Tyr His His Ser Leu Lys Lys Tyr Met Gin Lys Thr Ser Ser He 

665 670 675 

Pro Leu Asn Glu Leu Tyr Pro Pro Leu He Asn Leu trp Glu Gly 

680 685 690' 

Asp Ser Glu Lys. Asp Lys Asp Gly Ser Ala Asp Thr Lys Pro Thr 
. 695 700 705 

Gin Val Asp Thr Ser Arg Ser Tyr Tyr Met Trp 
710 715 

<210> 13 

<211> 665 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4568384CD1 

<400> 13 ( 

Met Val Leu Val Phe His Lys Gly Glu Leu Gly His Pro Leu Glu 
1 , ' 5 10 15 

Gin Ser Thr Asp Trp Pro Lys Ser Pro Lys Thr Pro Thr Gly Leu 
20 25 30 

Arg Arg Gly Arg din Cys He Arg Pro Ala Glu He Val Ala Ser 
35 40 45 

Leu Leu Glu Gly Glu Glu Asn Thr Cys Gly Lys Gin Lys Pro Lys 
50 55 60 

Glu Asn Asn Leu Lys Pro Lys Phe Gin Ala Phe Lys Gly Val Gly 
65 70 75 

Cys Leu Tyr Glu Lys Glu Ser Met Lys Lys Ser Leu Lys Asp Ser 
80 85 90 

Val Ala Ser Asn Asn Lys Asp Gin Asn Ser Met Lys His Glu Asp 
95 100 10 5 

Pro Ser He He Ser Met Glu Asp Gly Ser Pro Tyr Val Asn Gly 
HO 115 120 

Ser Leu Gly Glu Val Thr Pro Cys Gin His Ala Lys Lys Ala Asn 
125 130 135 

Gly Pro Asn Tyr He Gin Pro Gin Lys Arg Gin Thr Thr Phe Glu 
140 145 iso 

Ser Gin Asp Arg Lys Ala Val Ser Pro Ser Ser Ser Glu Lys Arg 
155 160 165 

Ser Lys Asn Pro He Ser Arg Pro Leu Glu Gly Lys Lys Ser Leu 
170 175 lao 

Ser Leu Ser Ala Lys Thr His Asn He Gly Phe Asp Lys Asp Ser 
185 190 195 

Cys His Ser Thr Thr Lys Thr Glu Ala Ser Gin Glu Glu Arg Ser 
200 205 210 

Asp Ser Ser Gly Leu Thr Ser Leu Lys Lys Ser Pro Lys Val Ser 
225 220 225 

Ser Lys Asp Thr Arg Glu He Lys Thr Asp Phe Ser Leu Ser He 
230 235 240 
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Ser Asn Ser Ser Asp Val Ser Ala Lys Asp Lys His Ala Glu Asp 

Asn Glu Lys Arg Leu Ala Ala Leu Glu Ala Arg Gin Lys Ala Lys 

260 265 
Glu Val Gin Lys Lys Leu Val His Asn Ala Leu Ala Asn Leu Asp 

Gly His Pro Glu Asp Lys. Pro Thr His lie lie Phe Gly Ser Asp 
290 • 295 JUU 

Ser Glu Cys Glu Thr Glu Glu Thr Ser Thr Gin Glu Gin Ser His 
305 310 31 5 

Pro Gly Glu Glu Trp Val Lys Glu Ser Met Gly Lys Thr Ser Gly 

Lys Leu Phe Asp Ser Ser Asp Asp Asp Glu Ser Asp Ser Glu Asp 

Asp Ser Asn Arg Phe Lys He Lys Pro Gin Phe Glu Gly Arg Ala ( 

Gly Gin Lys Leu Me! Asp Leu Gin Ser His Phe Gly Thr Asp Asp 

365 370 
Arg Phe Arg Met Asp Ser Arg Phe Leu Glu Thr Asp Ser Glu Glu , 

Glu Gin Glu Glu Val Asn Glu Lys Lys Thr Ala Glu Glu Glu Glu 
395 400 4U3 

Leu Ala Glu Glu Lys Lys Lys Ala Leu Asn Val Val Gin Ser Val 

Leu Gin He Asn tin Ser Asn Ser Thr Asn Arg Gly Ser Val Ala 

Ala Lys Lys Phe Lys Asp He lie His Tyr Asp Pro Thr Lys Gin 

Asp His Ala Thr Tyr Glu Arg Lys Arg Asp Asp Lys Pro Lys Glu. 

455 460 
Ser Lys Ala Lys Arg Lys Lys Lys Arg Glu Glu Ala Glu Lys Leu 

470 475 * 

Pro Glu Val Ser Lys Glu Met Tyr Tyr Asn He Ala Met Asp Leu 

Lys Glu He Phe Gin Thr Thr Lys Tyr Thr Ser Glu Lys Glu Glu 

500 505 . 

Gly Thr Pro Trp Asn Glu Asp Cys Gly Lys Glu Lys Pro Glu Glu , 

lie Gin Asp Pro 111 Ala Leu Thr Ser Asp Ala Glu Gin Pro Ser 

530 535 
Gly Phe Thr Phe Ser Phe Phe Asp Ser Asp Thr Lys Asp He Lys 



545 



Glu Glu Thr Tyr Arg Val Glu Thr Val Lys Pro Gly Lys He Val 



560 565 570' 

Trp Gin Glu Asp Pro Arg Leu Gin Asp Ser Ser Ser Glu Glu Glu 

575 580 -> OD 

Asp Val Thr Glu Glu Thr Asp His Arg Asn Ser Ser Pro Gly Glu 

Ala Ser Leu Leu Glu Lys Glu Thr Thr Arg Phe Phe Phe Phe Ser 

Lys Asn Asp Glu Ar"g Leu Gin Gly ser Asp Leu Phe Trp Arg Gly 

Val Gly Ser Asn Met Ser Arg Asn Ser Trp Glu Ala Arg Thr Thr 

Asn Leu Arg Met Asp Cys Arg Lys Lys His Lys Asp Ala Lys Arg 

650 655 
Lys Met Lys Pro Lys 

665 

<210> 14 
<211> 547 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID Mo: 4586187CD1 
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<400> 14 

Met Tyr Ser His Asn Val Val lie Met Asn Leu.Asn Asn Leu Asn 
1 ■ ; ■ 5 io 15 

Leu Thr Gin Val Gin Gin Arg Asn Leu He Thr Asn Leu Gin Ara 
20 25 30 

Ser Val Asp Asp Thr Ser Gin Ala He Gin Arg lie Lys Asn Asp 
35 40 45 

Phe Gin Asn Leu Gin Gin Val Phe Leu Gin Ala Lys Lys Asp Thr 
50 55 60 

Asp Trp Leu Lys Glu Lys Val Gin Ser Leu Gin Thr Leu Ala Ala 

65 70 . 75 

Asn Asn Ser Ala Leu Ala Lys Ala Asn Asn Asp Thr Leu Glu Asp 
80 85 90 

Met Asn Ser Gin Leu Asn Ser Phe Thr Gly Gin Met Glu Asn He 

95 100 105 

Thr Thr He Ser Gin Ala Asn Glu Gin Asn Leu Lys Asp Leu Gin 
110 115 120 

Asp Leu His Lys Asp Ala Glu Asn Arg Thr Ala He Lys Phe Asn 
125 130 135 

Gin Leu Glu Glu Arg Phe Gin Leu Phe Glu Thr Asp He Val Asn 

140 145 iso • 

He He Ser Asn. He Ser Tyr Thr Ala His His Leu Arg Thr Leu 
. 155 160 165 

Thr Ser Asn Leu Asn Glu Val Arg Thr Thr Cys Thr .Asp Thr Leu 
170 175 iso 

Thr Lys His Thr Asp Asp Leu Thr Ser Leu Asn Asn Thr Leu Ala 
185 190 195 

Asn He Arg Leu Asp Ser Val Ser Leu Arg Met Gin Gin Asp Leu 
200 205 210 

Met Arg Ser Arg Leu Asp Thr Glu Val Ala Asn Leu Ser Val He 
215 220 225 

Met Glu Glu Met Lys Leu Val Asp Ser Lys His Gly Gin Leu He 
230 235 240 

Lys Asn Phe Thr He Leu Gin Gly Pro Pro Gly Pro Arg Gly Pro 
245 250 - 255 

Arg Gly Asp Arg Gly Ser Gin Gly Pro Pro Gly Pro Thr Gly Asn 
260 265 270 

Lys Gly Gin Lys Gly Glu Lys Gly Glu Pro Gly Pro Pro Gly Pro 
275 280 285 

Ala Gly Glu Arg Gly Pro He Gly Pro Ala Gly Pro Pro Glv Glu 
290 295 300 

Arg Gly Gly Lys Gly Ser Lys Gly Ser Gin Gly Pro Lys Gly Ser 
305 310 315 

Arg Gly Ser Pro GJy Lys Pro Gly Pro Gin Gly Pro Ser Glv Asd 
320 325 330 

Pro Gly Pro Pro Gly Pro Pro Gly Lys Glu Gly Leu Pro Gly Pro 
335 340 345 

Gin Gly Pro Pro Gly Phe Gin Gly Leu Gin Gly Thr Val Gly Glu 
350 355 360 

Pro Gly Val Pro Gly Pro Arg Gly Leu Pro Gly Leu Pro Gly Val 
365 370 375 

Pro Gly Met Pro Gly Pro Lys Gly Pro Pro Gly Pro Pro Gly Pro 
380 385 390 

Ser Gly Ala Val Val Pro Leu Ala Leu Gin Asn Glu Pro Thr Pro 
395 400 405 

Ala Pro Glu Asp Asn Ser Cys Pro Pro His Trp Lys Asn Phe Thr 
410 415 420 

Asp Lys Cys Tyr Tyr Phe Ser Val Glu Lys Glu He Phe Glu Asp 
425 430 435 

Ala Lys Leu Phe Cys Glu Asp Lys Ser Ser His Leu Val Phe He 
440 445 45Q 

Asn Thr Arg Glu Glu Gin Gin Trp He Lys Lys Gin Met Val Gly 
455 460 465 

Arg Glu Ser His Trp He Gly Leu Thr Asp Ser Glu Arg Glu Asn 

* r 470 475 480 

Glu Trp Lys Trp Leu Asp Gly Thr Ser Pro Asp Tyr Lys Asn Trp 
485 490 495 
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Lys Ala Gly' Gin Pro Asp Asn Trp Gly His Gly His Gly Pro Gly 

5O0 505 510 

Glu Asp cys Ala Gly Leu lie Tyr Ala Gly Gin Trp Asn Asp Phe 

S15 520 525 

Gin Cys Glu Asp Val Asn Asn Phe He Cys Glu Lys Asp Arg Glu 

530 535 S40 

Thr Val Leu Ser Ser Ala Leu 

545 

<210> 15 

<211> 109 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 401801CD1 

<400> 15 

3Kr> Tim T.dii nln nlu Asn He 1 _ 

10 15 
Asp Glu Leu Trp Val Phe 
25 30 
Lys Thr Pro Thr Val Leu 
40 45 
Gly His Ser Ala His He 
55 60 
Met He He He Phe Gly 
70 75 
Ser He Gin Glu Tyr His 
85 90 
Phe Phe He Asp Trp Glu 
100 105 

3er Leu 

<210> 16 
<211> 192 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 1721842CD1 

<400> 16 



Met 


Tyr 


Phe 


Asn 


Leu 


Gin 


Glu 


Asn 


He 


. 1 






5 










He 


Glu 


Thr 


Asn 


Asp 
20 


Gly 


Asn 


Val 


Thr 


Asn 


He 


His 


Ser 


Gin 
35 


Ser 


Trp 


Ser 


Thr 


Gly 


His 


Gly 


Gin 


Gin 


Tyr 


Ala 


Val 


Glu 








50 










Met 


Glu 


Leu 


Asp 


Ser 
65 


Arg 


Asp 


Val 


Val 


Tyr 


Ser 


Ala 


He 


Tyr 


Gly 


Tyr 


Thr 


Ser 








80 










He 


Cys 


Glu 


Leu 


Leu 


Lys 


Asn 


Cys 


Asn 








95 










Cys 


Phe 


Ser 


Leu 













Met 


Asn 


Lys 


Arg 


Asp 


Tyr 


Met 


1 








5 






Leu 


Asp 


Tyr 


Ser 


Phe 
20 


Arg 


Ser 


Asn 


Glu 


Glu 


Pro 


Arg 
35 


Thr 


Gly 


Ser 


Gly 


Lys 


Ser 


Leu 


Thr 


Gin 








50 






Leu 


Asn 


Asp 


Leu 


Arg 
65 


Asp 


Phe 


Glu 


His 


Pro 


Glu 


Asn 
80 


Leu 


Ala 


Leu 


Thr 


Ser 


He 


Asp 
95 


Pro 


Val 


Val 


Leu 


Tyr 


Leu 


His 
110 


Gly 


Asn 


Asn 


Lys 


Leu 


Ala 


Val 


Leu 


Pro 








12 5 






Gly 


Asn 


Pro 


Met 


Glu 


GlU 


Glu 








140 






Cys 


Thr 


Leu 


Ser 


Arg 


He 


Thr 








155 






Lys 


Ala 


Asp 


Arg 


Thr 


Thr 


Ala 



Asn 


Thr 


Ser 


Val 


Gin 


Glu 


Pro 


Pro 






10 










15 


He 


His 


Val 


He 


Gin 


Asp 


Leu 


Val 






25 








30 


Leu 


Arg 


Pro 


Leu 


Lys 


Arg 


Ser 


Lys 




40 










45 


Ser 


Leu 


Trp 


Leu 


Asn 


Asn 


Asn 


Val 






55 










60 


Asn 


Gin 


Val 


Ala 


Ser 


Gin 


Leu 


Leu 






70 










75 


Trp 


He 


Asp 


Leu 


Ser 


Phe 


Asn 


Asp 




85 










90 


Leu 


Thr 


Thr 


Phe 


Phe 


Asn 


Leu 


Ser 






100 










105 


Ser 


He 


Gin 


Arg 


Leu 


Gly 


Glu 


Val 






115 










120 


Arg 


Leu 


Arg 


Ser 


Leu 


Thr 


Leu 


His 




130 










135 


Lys 


Gly 


Tyr 


Arg 


Gin 


Tyr 


Val 


Leu 




145 










150 


Thr 


Phe 


Asp 


Phe 


Ser 


Gly 


Val 


Thr 






160 










165 


Glu 


Val 


Trp 


Lys 


Arg 


Met 


Asn 


He 
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170 175 180 

Lys Pro Lys Lys Ala Trp Thr Lys Gin Asn Thr Leu 
185 190 

<210> 17 

<211> 575 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1833221CD1 

<400> 17 

Met Val Leu Gly Ser Phe Gly Thr Asp Leu Met Arg Glu ,Arg Arg 
1 5 io 15 

Asp Leu Glu Arg Arg Thr Asp Ser *Ser He Ser Asn Leu Met Asp 
20 25 30 

Tyr Ser His Arg Ser Gly Asp Phe Thr Thr Ser 'Ser Tyr Val Gin 
35 40 45 

Asp Arg Val Pro Ser Tyr Ser Gin Gly Ala Arg Pro Lys Glu Asn 
50 55 60. 

Ser Met Ser Thr Leu Gin Leu Asri Thr Ser Ser Thr Asn His Gin 
65 .70 '75 

Leu Pro Ser Glu His Gin Thr He Leu Ser Ser Arg Asp Ser Arg 
80 85 90 

Asn Ser Leu Arg Ser Asn Phe Ser Ser Arg Glu Ser Glu Ser Ser 
95 100 105 

Arg Ser Asn Thr Gin Pro Gly Phe Ser Tyr Ser Ser Ser Arg Asp 
HO us 120 

Glu Ala Pro He He Ser Asn Ser Glu Arg Val Val Ser Ser Gin 
125 130 135 

Arg Pro Phe Gin Glu Ser Ser Asp Asn Glu Gly Arg Arg Thr Thr 

140 145 150 . ' 

Arg Arg Leu Leu Ser Arg lie Ala Ser Ser Met Ser Ser Thr Phe 
.155 160- 165 

Phe Ser Arg Arg Ser Ser Gin Asp Ser Leu Asn Thr Arg Ser Leu 
1'70 175 180 

Asn Ser Glu Asn Ser Tyr Val Ser Pro Arg He Leu Thr Ala Ser 
185 190 195 

Gin Ser Arg Ser Asn Val Pro Ser Ala Ser Glu Val Pro Asp Asn 
200 205 210 

Arg Ala Ser Glu Ala Ser Gin Gly Phe Arg Phe Leu Arg Arg Arg 
215 220 225 

Trp Gly Leu Ser Ser Leu Ser His Asn His Ser Ser Glu Ser Asp 
230 235 240 

Ser Glu Asn Phe Asn Gin Glu Ser Glu Gly Arg Asn Thr Gly Pro 
245 250 255 

Trp Leu Ser Ser- Ser Leu Arg Asn Arg Cys Thr Pro Leu Phe Ser 
260 265 270 

Arg Arg Arg Arg Glu Gly Arg Asp Glu Ser Ser Arg He Pro Thr 
275 280 285 

Ser Asp Thr Ser Ser Arg Ser His He Phe Arg Arg Glu Ser Asn 
290 295 300. 

Glu Val Val His Leu Glu Ala Gin Asn Asp Pro Leu Gly Ala Ala 
305 310 315 

Ala Asn Arg Pro Gin Ala Ser Ala Ala Ser Ser Ser Ala Thr Thr 
320 325 330 

Gly Gly Ser Thr Ser Asp Ser Ala Gin Gly Gly Arg Asn Thr Gly 
335 340 345 

He Ser Gly He Leu Pro Gly Ser Leu Phe Arg Phe Ala Val Pro 
350 355 360 

Pro Ala Leu Gly Ser Asn Leu Thr Asp Asn Val Met He Thr Val 
365 370 375 

Asp He He Pro Ser Gly Trp Asn Ser Ala Asp Gly Lys Ser Asp 
380 385 390 

Lys Thr Lys Ser Ala Pro Ser Arg Asp Pro Glu Arg Leu Gin Lys 
395 400 405 
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He Lys Glu Ser Leu Leu Leu Glu Asp Ser Glu Glu Glu Glu Gly 
410 415 420 

Asp Leu Cys Arg lie Cys Gin Met Ala Ala Ala Ser Ser Ser Asn 

425 430 435 

Leu Leu lie Glu Pro Cys Lys Cys Thr Gly Ser Leu Gin Tyr Val 

440 445 450 

His Gin Asp Cys Met Lys Lys Trp Leu Gin Ala Lys He Asn Ser 
t 455 460 465 

Glv Ser Ser Leu Glu Ala Val Thr Thr Cys Glu Leu Cys Lys Glu 

470 475 480 

Lvs Leu Glu Leu Asn Leu Glu Asp Phe Asp He His Glu Leu His 
* 485 4,90 495 

Ara Ala His Ala Asn Glu Gin Ala Glu Tyr Glu Phe He Ser Ser 
y 500 505 510 

Glv Leu Tyr Leu Val Val Leu Leu His Leu Cys Glu Gin Ser Phe 

515 520 525 

Ser Asp Met Met Gly Asn Thr Asn Glu Pro Ser Thr Arg Val Arg 

530 ' 535 540 

Phe He Asn Leu Ala Arg Thr Leu Gin Ala His Met Glu Asp Leu 

545 550 555 

Glu Thr Ser Glu Asp Asp Ser Glu Glu Asp Gly Asp His Asn Arg 

560 • 565 ' 570 

Thr Phe Asp He Ala 

575 

<210> 18 . , 
<211> 342 ' 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2041168CD1 

Met°Ala 8 Glu Gly Gly Ser Gly Asp Val Asp Asp Ala Gly Asp Cys 
x 5 10 15 

Ser Gly Ala Arg Tyr Asn Asp Trp Ser Asp Asp Asp Asp Asp Ser 
20 25 30 

Asn Glu ser Lys Ser He Val Trp Tyr Pro Pro Trp Ala Arg lie 
35 4 0 4b 

Gly Thr Glu Ala Gly Thr Arg Ala Arg Ala Arg Ala Arg Ala Arg 
50 55 60 

Ala Thr Arg Ala Arg Arg Ala Val Gin Lys Arg Ala Ser Pro Asn 
65 70 vb 

Ser Asp Asp Thr Val Leu Ser Pro Gin Glu Leu Gin Lys Val Leu 
80 85 90 

Cvs Leu Val Glu Met Ser Glu Lys Pro Tyr He Leu Glu Ala Ala 
y 95 100 105 

Leu He Ala Leu Gly Asn Asn Ala Ala Tyr Ala Phe Asn Arg Asp 
110 H5 120 

He He Arg Asp Leu Gly Gly Leu Pro He Val Ala Lys He Leu 
125 130 13b 

Asn Thr Arg Asp Pro He Val Lys Glu Lys Ala Leu He Val Leu 
140 145 150 

Asn Asn Leu Ser Val Asn Ala Glu Asn Gin Arg Arg Leu Lys Val 
155 160 165 

Tyr Met Asn Gin Val Cys Asp Asp Thr He Thr Ser Arg Leu Asn 
170 175 180 

Ser Ser Val Gin Leu Ala Gly Leu Arg Leu Leu Thr Asn Met Thr 
185 190 195 

Val ^hr Asn Glu Tyr Gin His Met Leu Ala Asn Ser He Ser Asp 
200 205 210 

Phe Phe Arg Leu Phe Ser Ala Gly Asn Glu Glu Thr Lys Leu Gin 
215 220 225 

Val Leu Lys Leu Leu Leu Asn Leu Ala Glu Asn Pro Ala Met Thr 
230 235 240 

Arg Glu Leu Leu Arg Ala Gin Val Pro Ser Ser Leu Gly Ser Leu 
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D . ^ 245 250 • 255 

Phe Asn Lys Lys Glu Asn Lys Glu Val lie Leu Lys Leu Leu Val 
260 265 270 

He Phe Glu Asn lie Asn Asp Asn Phe Lys Trp Glu Glu Asn Glu 
275 280 285 

Pro Thr Gin Asn Gin Phe Gly Glu Gly Ser Leu Phe Phe Phe Leu 
290 295 300 

Lys Glu Phe Gin Val Cys Ala Asp Lys Val Leu Gly lie Glu Ser 
305 310 315 

His His Asp Phe Leu Val Lys Val Lys Val . Gly Lys Phe Met Ala 
320 325 330 

Lys Leu Ala Glu His Met Phe Pro Lys Ser Gin Glu 

335 340 

<210> 19 
<211> 110 

<212> PRT , ' 

<213> Homo sapiens ' ( 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2365794CD1 

<400> 19 

Met Ala Ala Val Val Ala Lys Arg Glu Gly Pro Pro Phe He Ser 

Glu Ala Ala Val Arg Gly Asn Ala Ala Val Leu Asp Tyr Cys Arg 

20 25 30 

Thr Ser Val Ser Ala Leu Ser Gly Ala Thr Ala Gly lie Leu Gly 

35 40 45 

Leu Thr Gly Leu Tyr Gly Phe He Phe Tyr Leu Leu Ala Ser Val 

50 55 60 

Leu Leu Ser Leu Leu Leu He Leu Lys Ala Gly Arg Arg Trp Asn 

65 70 75 » 

Lys Tyr Phe Lys Ser Arg Arg Pro Leu Phe Thr ply Gly Leu He .' 

80 85 90 

Gly Gly Leu Phe Thr Tyr Val Leu Phe Trp Thr Phe Leu Tyr Gly ' ' 

95 100 1 0 q' 

Met Val His Val Tyr 
110 

<210> 20 

<211> 571 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2618452CD1 

<400> 20 

Met Pro Thr Gly Thr He Pro Pro Pro Thr Thr Leu Lys Ala Thr 

* 5 10 15 

Gly ser Thr His Thr Ala Pro Pro Met Met Pro Thr Thr Ser Glv 

20 25 30 

Thr Ser Gin Ala Ser Ser Ser Phe Asn Thr Ala Lys Thr Ser Thr * 

35 ' 40 45 

Ser Leu His Ser His Thr Ser Ser Thr His His Pro Glu Val Thr 

50 55 60 

Pro Thr Ser He Thr Asn He Thr Leu Asn Pro Thr Ser He Gly 

Thr Trp Thr Pro Val Ala His Thr Thr Ser Ala Thr Ser Ser Aw 
80 85 90 

Leu Thr Thr Pro Phe Thr Thr His Ser Pro Pro Thr Gly Ser Ser 

95 100 105 

Pro He Ser Ser Thr Gly Pro Met Thr Ala Thr Ser Phe Gin Thr 

Thr Thr Tyr Tyr Thr Pro Pro Ser His Pro Gin Thr Thr Leu Pro 
125 130 135 
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Thr His Val' Pro Pro Phe Ser Thr Ser Leu Val Thr Pro Ser Thr 
140 145 150 

His Thr Val lie lie Thr Thr His Thr Gin Met Ala Thr Ser Ala 
155 160 165 

Ser He His Ser Thr Pro Thr Gly Thr Val Pro Pro Pro Thr Thr 
170 175 180 

Leu Lys Ala Thr Gly Ser Thr His Thr Ala Pro Pro Met Thr Val 
185 190 195 

Thr Thr Ser Gly Thr Ser Gin Thr His Ser Ser Phe Ser Thr Ala 
200 205 210 

Thr Ala Ser Ser Ser Phe He Ser Ser Ser Ser Trp Ser Ser Trp 
215 220 225 

Leu Pro Gin Asn Ser Ser Ser Arg Pro Pro Ser Ser Pro He Thr 
230 235 240 

Thr Gin Leu Pro His Leu Ser Ser Ala Thr Thr Pro Val Ser Thr 
245 250 255 

Thr Asn Gin Leu Ser Ser Ser Phe Ser Pro Ser Pro Ser Ala Pro 
260 265 , 270 

Ser Thr Val Ser Ser Tyr Val Pro Ser Ser His Ser Ser Pro Gin • 

275 280 285 

Thr Ser Ser Pro Ser Val Gly Thr Ser Ser Ser Phe Val Ser Ala 
290 ' 295 300 

Pro Val His Ser Thr Thr Leu Ser Ser Gly Ser His Ser Ser Leu 
305 310 315 

Ser Thr His Pro Thr Thr Ala Ser Val Ser Ala Ser Pro Leu Phe 
320 325 330 

Pro Ser Ser Pro Ala Ala Ser Thr Thr He Arg Ala Thr Leu Pro 
335 340 345 

His Thr He Ser Ser Pro Phe Thr Leu Ser Ala Leu Leu Pro lie 
350 355 360 

Ser Thr Val Thr Val Ser Pro Thr Pro Ser Ser His Leu Ala Ser 
365 370 375 

Ser Thr He Ala Phe Pro Ser Thr Pro Arg Thr Thr Ala Ser Thr 
380 385 390 

His Thr Ala Pro Ala Phe Ser Ser Gin Ser Thr Thr Ser Arg Ser 
395 400 . 405 

Thr Ser Leu Thr Thr Arg Val Pro Thr Ser Gly Pne Val Ser Leu ^ , 

410 415 420 

Thr Ser Gly Val Thr Gly He Pro Thr Ser Pro Val Thr Asn Leu 
425 430 435 

Thr Thr Arg His Pro Gly Pro Thr Leu Ser Pro Thr Thr Arg Phe 
440 445 450 

Leu Thr Ser Ser Leu Thr Ala His Gly Ser Thr Pro Ala Ser Ala 
455 460 465 

Pro Val Ser Ser Leu Gly Thr Pro Thr Pro Thr Ser Pro Gly Val 
470 475 480 

Cvs Ser val Arg Glu Gin Gin Glu Glu He Thr Phe Lys Gly cys 
y 485 490 495 

Met Ala Asn Val Thr Val Thr Arg Cys Glu Gly Ala Cys He Ser 
500 505 510 

Ala Ala Ser Phe Asn He He Thr Gin Gin Val Asp Ala Arg Cys 
515 520 525 

Ser Cys Cys Arg Pro Leu His Ser Tyr Glu Gin Gin Leu Glu Leu 
530 535 540 

Pro Cys Pro Asp Pro Ser Thr Pro Gly Arg Arg Leu Val Leu Thr 
y 545 . 550 . 555 

Leu Gin Val Phe Ser His Cys Val Cys Ser Ser Val Ala Cys Gly 
560 565 =>/U 

Asp 

<210> 21 
<211> 262 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2622288CD1 
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<400> 21 

Met Val Ala Trp Arg Ser Ala Phe Leu Val Cys Leu Ala Phe Ser 

1 5 10 15 

Leu Ala Thr Leu Val Gin Arg Gly Ser Gly Asp Phe Asp Asp Phe 
20 25 30 

Asn Leu Glu Asp Ala Val Lys Glu Thr Ser Ser Val Lys Gin Pro 
35 .40 45 

Trp Asp His Thr Thr Thr Thr Thr Thr Asn Arg Pro Gly Thr Thr 
50 55 60 

Arg Ala Pro Ala Lys Pro Pro Gly Ser Gly Leu Asp Leu Ala Asp 
65 70 • 75 

Ala Leu Asp Asp Gin Asp Asp Gly Arg Arg Lys Pro Gly He Gly 
80 85 90 

Gly Arg Glu Arg Trp Asn His Val Thr Thr Thr Thr Lys Arg Pro 

95 100 105 ' 

Val Thr Thr Arg Ala Pro Ala Asn Thr Leu Gly Asn Asp Phe Asp 
HO us , 120 

Leu Ala Asp Ala Leu Asp Asp Arg Asn Asp Arg Asp Asp Gly Arg 
125 130 135 

Arg Lys Pro He Ala Gly Gly Gly Gly Phe Ser Asp Lys Asp Leu 

140 145 150 ' 

Glu Asp He Val Gly Gly Gly Glu Tyr Lys Pro Asp Lys Gly Lys 
■ - _ • 155 160 ' 165 

Gly Asp Gly Arg Tyr Gly Ser Asn Asp Asp Pro Gly Ser Gly Met 
170 175 180 

Val Ala Glu Pro Gly Thr He Ala Gly Val Ala Ser Ala Leu Ala 
185 190 195 

Met Ala Leu lie Gly Ala Val Ser Ser Tyr He Ser Tyr Gin Gin 
200 205 210 

Lys Lys Phe Cys Phe Ser He Gin Gin Gly Leu Asn Ala Asp Tyr 
215 220 225 

Val Lys Gly Glu Asn Leu Glu Ala Val Val Cys Glu Glu Pro Gin 

230 235 240 ■ ' 

Val Lys Tyr Ser Thr Leu His Thr Gin Ser Ala Glu Pro Pro Pro 

245 250 ' 255 ; 

•Pro Pro Glu Pro Ala Arg He , 
260 

<210> 22 
<211> 172 

<212> PRT * 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2806595CD1 

<400> 22 



Met 


Gly 


Leu 


Leu 


Leu 


Leu 


Val 


Pro 


Leu 


Leu 


Leu 


Leu 


Pro 


Gly 


Ser 


1 


Gly 






5 










10 








15 


Tyr 


Leu 


Pro 


Phe 


Tyr 


Asn 


Gly 


Phe 


Tyr 


Tyr 


Ser 


Asn 


Ser 


Ala 


Asn 








20 










25 








30 


Asp 


Gin 


Asn 


Leu 


Gly Asn 


Gly 


His 


Gly 


Lys 


Asp 


Leu 


Leu 


Asn 


Gly 








35 










40 








45 


Val 


Lys 


Leu 


Val 


Val 


Glu 


Thr 


Pro 


Glu 


Glu 


Thr 


Leu 


Phe 


Thr 


Tyr 


Gin 






50 










55 










60 


Gly Ala 


Ser 


Val 


He 


Leu 


Pro 


Cys 


Arg 


Tyr 


Arg 


Tyr 


Glu 


Pro 


Ala 






65 










70 








75 


Leu 


Val 


Ser 


Pro 


Arg 


Arg 


Val 


Arg 


Val 


Lys 


Trp 


Trp 


Lys 


Leu 








80 










85 








90 


Ser 


Glu 


Asn 


Gly Ala 


Pro 


Glu 


Lys 


Asp 


Val 


Leu 


Val 


Ala 


He 


Gly 








95 










100 










105 


Leu 


Arg 


His 


Arg 


Ser 


Phe 


Gly 


Asp 


Tyr 


Gin 


Gly 


Arg 


Val 


His 


Leu 








110 










115 








120 


Arg 


Gin 


Asp 


Lys 


Glu 


His 


Asp 


Val 


Ser 


Leu 


Glu 


He 


Gin 


Asp 


Leu 








125 










130 










135 


Arg 


Leu 


Glu 


Asp 


Tyr 


Gly Arg 


Tyr 




Cys 


Glu 


Val 


He 


Asp 










140 










145 










150 
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Gly Leu Glu Asp Glu Ser Gly Leu Val Glu Leu Glu Leu Arg Gly 

Glu Met Leu Thr Gly Thr Gly 
170 

<210> 23 

<211> 571 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> miscjeature 

<223> Incyte ID No: 2850987CD1 

Me^Thr^Arg Ala Gly Asp His Asn Arg Gin Arg Gly Cys Cys Gly , 
Ser Leu Ala Asp Tyr Leu Thr Ser Ala Lys Phe Leu Leu Tyr Leu 
Gly His Ser Leu Ser Thr Trp Gly Asp Arg Met Trp His Phe Ala , 



35 



Val Ser Val Phe Leu Val Glu Leu Tyr Gly Asn Ser Leu Leu Leu 
— 5 5 D u 



Thr Ala Val Tyr Gly Leu Val Val Ala Gly Ser Val Leu Val Leu 
Gly Ala He lie Gly Asp Trp Val Asp Lys Asn Ala Arg Leu Lys 



Val Ala Gin Thr Ser Leu Val Val Gin Asn Val Ser Val lie Leu 

Cys Gly He He Leu Met Met Val Phe Leu His Lys His Glu Leu' 

Leu Thr Met Tyr His Gly Trp Val Leu Thr Ser Cys Tyr lie Leu 

lie lie Thr He 111 Asn He Ala Asn Leu Ala Ser Thr Ala Thr 

Ala He Thr He GlS Arg Asp Trp He Val Val Val Ala Gly Glu 

Asp Arg ser Lys III Ala Asn Met Asn Ala Thr lie Arg Arg' lie 

Asp Gin Leu Thr Asn He Leu Ala Pro Met Ala Val Gly Gin He 

Met Thr Phe Gly Ser Pro Val He Gly Cys Gly Phe He Ser Gly 

Trp Asn Leu Val llr Met Cys Val Glu Tyr Val Leu Leu Trp Lys. 

Val Tyr Gin Lys Tnr Pro Ala Leu Ala Val Lys Ala Gly Leu Lys 

Glu Glu Glu Thr Glu Leu Lys Gin Leu Asn Leu His Lys Asp Thr 

Glu Pro Lys Pro Leu Glu Gly Thr His Leu Met Gly Val Lys Asp 

Ser Asn He His llu Leu Glu His Glu Gin Glu Pro Thr Cys Ala 

Ser Gin Met Ala Glu Pro Phe Arg Thr Phe Arg Asp Gly Trp Val 

ser Tyr Tyr Asn Gin Pro Val Phe Leu Ala Gly Met Gly Leu Ala 
305 310 JJ - J 

Phe Leu Tyr Met Thr Val Leu Gly Phe Asp Cys He Thr Thr Gly 

Tyr Ala Tyr Thr Gin Gly Leu Ser Gly Ser He Leu Ser He Leu 

Met Gly Ala Ser III He Thr Gly He Met Gly Thr Val Ala Phe 

Thr Trp Leu Arg Arg Lys Cys Gly Leu Val Arg Thr Gly Leu lie 

Ser Gly Leu Ala Gin Leu ser Cys Leu lie Leu Cys Val He Ser 

Val Phe Met Pro Gly Ser Pro Leu Asp Leu Ser Val Ser Pro Phe 
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395 400 405 

Glu Asp lie Arg Ser Arg Phe He Gin Gly Glu Ser He Thr Pro 

410 415 420 

Thr Lys He Pro Glu He Thr Thr Glu He Tyr Met Ser Asn Gly 

425 430 435 

Ser Asn Ser Ala Asn He Val Pro Glu Thr Ser Pro Glu Ser Val 

440 445 450 

Pro He He Ser Val Ser Leu Leu Phe Ala Gly Val He Ala Ala 

455 46O 465 

Arg He Gly Leu Trp Ser Phe Asp Leu Thr Val Thr Gin Leu Leu 

470 475 480 

Gin Glu Asn Val He Glu Ser Glu Arg Gly He He Asn Gly Val 

485 490 495 

Gin Asn Ser Met Asn Tyr Leu Leu Asp Leu Leu His Phe He Met 

500 505 5io 

Val He Leu Ala Pro Asn Pro Glu Ala Phe Gly Leu Leu Val Leu 

515 520 525 

lie Ser Val Ser Phe Val Ala Met Gly His He Met Tyr Phe Ara 

530 535 540 

Phe Ala Gin Asn Thr Leu Gly Asn Lys Leu Phe Ala Cys Gly Pro 

545 ' 550 555 

Asp Ala Lys Glu Val Arg Lys Glu Asn Gin Ala Asn Thr Ser Val 

560 565 570 

Val 

<210> 24 

<211> 455 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3557211CD1 

<400> 24 

Met Asp Pro Thr Gly Asn Ser Ala Thr Pro .Gin lie Leu Glu Leu 
1 5 io 15 

Lys Trp Ser His He Glu Trp Ser Gin Thr Glu Tyr He Cys Glu 
' 20 25 30 

Asn Val Gly Leu Leu Pro Leu Glu He He Arg Arg Gly Tyr Ser 
35 40 45 

Met Asp Ser Ala Phe Val Gly He Lys Val Asn Gin Val Ser Ala 
50 55 60 

Ala Val Gly Lys Asp Phe Thr Val He Pro Ser Lys Leu He Gin 
65 70 75 

Phe Asp Pro Gly Met Ser Thr Lys Met Trp Asn He Ala He Thr 
80 85 90 

Tyr Asp Gly Leu Glu Glu Asp Asp Glu Val Phe Glu Val He Leu 
95 100 105 

Asn Ser Pro Val Asn Ala Val Leu Gly Thr Lys Thr Lys Ala Ala 
110 115 120 

Val Lys He Leu Asp Ser Lys Gly Gly Gin Cys His Pro Ser Tyr 
125 130 135 

Ser Ser Asn Gin Ser Lys His Ser Thr Trp Glu Lys Gly He Trp 
140 145 iso 

His Leu Leu Pro Pro Gly Ser Ser Ser Ser Thr Thr Ser Gly Ser 
155 160 165 

Phe His Leu Glu Arg Arg Pro Leu Pro Ser Ser Met Gin Leu Ala 
I 70 175 iso 

Val He Arg Gly Asp Thr Leu Arg Gly Phe Asp Ser Thr Asp Leu 
185 190 195 

Ser Gin Arg Lys Leu Arg Thr Arg Gly Asn Gly Lys Thr Val Arg 
200 205 210 

Pro Ser Ser Val Tyr Arg Asn Gly Thr Asp He He Tyr Asn Tyr 
215 220 225 

His Gly He Val Ser Leu Lys Leu Glu Asp Asp Ser Phe Pro Thr 
230 235 240 

His Lys Arg Lys Ala Lys Val Ser He He Ser Gin Pro Gin Lys 
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245 250 255 

Thr lie Lys Val Ala Glu Leu Pro Gin Ala Asp Lys Val Glu Ser 

260 265 270 

Thr Thr Asp Ser His Phe Pro Arg Gin Asp Gin Leu Pro Ser Phe 

275 280 28b 

Pro Lvs Asn Cys Thr Leu Glu Leu Lys Gly Leu Phe His Phe Glu 

290 295 , 300 

Glu Gly lie Gin Lys Leu Tyr Gin Cys Asn Gly lie Ala Trp Lys 

305 310 315 

Ala Trp Ser Pro Gin Thr Lys Asp Val Glu Asp Lys Ser Cys Pro 

320 325 330 

Ala Gly Trp His Gin His Ser Gly Tyr Cys His He Leu He Thr 

335 340 34b 

Glu Gin Lys Gly Thr Trp Asn Ala Ala Ala Gin Ala Cys Arg Glu 

350 355 360 ( 

Gin Tvr Leu Gly Asn Leu Val Thr Val Phe Ser Arg Gin His Met 

365 370 375 

Arg Trp Leu Trp Asp He Gly Gly Arg Lys Ser Phe Trp He Gly 

Leu Asn Asp Gin Val His Ala Gly His Trp Glu Trp He Gly Gly 

395 400 40b 

Glu Pro Val Ala Phe Thr Asn Gly Arg Arg Gly Pro Ser Pro Arg 

410 415 420 

Ser Lvs Leu Gly Lys Ser Cys Val Leu Val Gin Arg Gin Gly Lys 

425 430 435 

Trp Gin Thr Lys Asp Cys Arg Arg Ala Lys Pro His Asn Tyr Val 

* 440 445 450 

Cys Ser Arg Lys Leu 
455 

<210> 25 
<211> 437 
<212> PRT 

<213> Homo sapiens 1 
<220> 

<221> misc_f eature 

<223> incyte ID No: 4675668CD1 / . 

M^t°Pro 5 Lys Phe Lys Ala Ala Arg Gly Val Gly Gly Gin Glu Lys ; 

1 5 10 ib 

His Ala Pro Leu Ala Asp Gin He Leu Ala Gly Asn Ala Val Arg 
20 25 30 

Ala Gly Val Arg Glu Lys Arg Arg Gly Arg Gly Thr Gly Glu Ala 
3 5 4 0 4b 

Glu Glu Glu Tyr Val Gly Pro Arg Leu Ser Arg Arg He Leu Gin 
50 55 60 

Gin Ala Arg Gin Gin Gin Glu Glu Leu Glu Ala Glu His Gly Thr 
65 70 75 

Gly Asp Lys Pro Ala Ala Pro Arg Glu Arg Thr Thr Arg Leu Gly 
80 85 90 

Pro Arg Met Pro Gin Asp Gly Ser Asp Asp Glu Asp Glu Glu Trp 
95 100 105 

Pro Thr Leu Glu Lys Ala Ala Thr Met Thr Ala Ala Gly His His . 

110 115 120 

Ala Glu Val Val Val Asp Pro Glu Asp Glu Arg Ala He Glu Met 
125 130 135 

Phe Met Asn Lys Asn Pro Pro Ala Arg Arg Thr Leu Ala Asp lie 
140 145 150 

He Met Glu Lys Leu Thr Glu Lys Gin Thr Glu Val Glu Thr Val 
155 160 165 

Met Ser Glu Val Ser Gly Phe Pro Met Pro Gin Leu Asp Pro Arg 
170 175 I 80 

Val Leu Glu Val Tyr Arg Gly Val Arg Glu Val Leu Ser Lys Tyr 
185 190 195 

Arg Ser Gly Lys Leu Pro Lys Ala Phe Lys He He Pro Ala Leu 
200 205 .210 
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Ser Asn Trp Glu Gin He Leu Tyr Val Thr Glu Pro Glu Ala Trp 
215 220 225 

Thr Ala Ala Ala Met Tyr Gin Ala Thr Arg He Phe Ala Ser Asn 
230 235 240 

Leu Lys Glu Arg Met Ala Gin Arg Phe Tyr Asn Leu Val Leu Leu 
245 250 255 

Pro Arg Val Arg Asp Asp Val Ala Glu Tyr Lys Arg Leu Asn Phe 
260 265 270 

His Leu Tyr Met Ala Leu Lys Lys Ala Leu Phe Lys Pro Gly Ala 
275 280 285 

Trp Phe Lys Gly He Leu He Pro Leu Cys Glu Ser Gly Thr Cvs 
^ 290 295 3'bo 

Thr Leu Arg Glu Ala He He Val Gly Ser lie lie Thr Lys Cys 
305 310 315 

Ser He Pro Val Leu His Ser Ser Ala Ala Met Leu Lys He Ala 
320 • 325 33 0 

Glu Met Glu Tyr Ser Gly Ala Asn Ser He Phe Leu Arg Leu Leu 
335 340 345 

Leu Asp Lys Lys Tyr Ala Leu Pro Tyr Arg Val Leu Asp Ala Leu 
350 355 < 360 

Val Phe His Phe Leu Gly Phe Arg Thr Glu Lys Arg Glu Leu Pro 
. 365 370 375 

Val Leu Trp His G,ln Cys Leu Leu Thr Leu Val Gin Arg Tyr Lys 

380 385 
Ala Asp Leu Ala Thr Asp Gin Lys Glu Ala Leu Leu Glu Leu Leu 
395 400 405 

Arg Leu Gin Pro His Pro Gin Leu Ser Pro Glu He Arg Arg Glu 
410 415 420 

Leu Gin Ser Ala Val Pro Arg Asp Val Glu Asp Val Pro He Thr 

425 430 4ic 

Val Glu • 435 

<210> 26 

<211> 2893 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 398269CB1 

<400> 26 

tcgggggcgg gctgggaggg ctgtcggtgg gccagtctgc gtacgacggc 60 
tofo^™ cgcacggacg ccgggaagaa gggggtgggg ccacgtttgc gtccgcgcca 12 0 

gata 9 cg ? cg aggtccgctt tcagtgtatg gttttccctg ccaaacggtt 180 
ctgcttggtg ccatccatgg agggcgtgcg ctgggccttt tcctgcggca cttggctgcc 24 0 
foocclaftc 2SK?oc£ ^cagtgcg atcgattcag cccgfgglga aggfgcgclt III 
aUt^lll* gtcttt 9 c <^ pgacgctaa ggcagccatg gctggtcgtc tgatgataag 360 
gaaattagtt gcagagaaat tgaatatccc ttggaatcat attcgtttgc aaagaactgc 420 
lllallrllt ccagttctt ? caaaggactc atcgaatcct tacccgaaft tcalctttIS 480 

caaggagact atgcagtgct tgctgctgaa cctgagctgc aagttggaat 540 
olaaaoaaao aa ? aCtagtt "ccaggtcg tggttcaatt ccagaattct ttcataltat 6u0 

" ta ^ aa ^ a aa 9aatggga aacaatcaga agctttaagg atgagtggac 660 
tcagctggat atgttttata ggaattgggc acttaaggaa agcttcataa aagccattgg 720 
lllllllltl g ^ttgaat tgcagcggct tgaatttgat ctatctccat talac?tgg2 7io 

gtttataaag aaacacgttt attcctggat ggagaggaag aaaaagaatg 840 
aattnnSnf gaaa ^ aaaa tagatgagca ccattttgtt gcagttgctc ttaggaaacc 900 
cgatggatct agacatcagg atgttccatc tcaggatgat tccaaaccaa cccagaggca 960 

tcct?cat" ?J? at ^ aat gtCatct * cc gttcccatga cacctgEga 1020 

tgggactgtt tttgcttcac agaagaaatt ccaatacgaa atggtacaaa 1080 
gtcatgatga ttccctgagt aacaaaggga aatgaaaact gtttgtgatc ttccgtattc 1140 
aaa aft^ t9t tta ^ tatcaa ""tt.tttc acgafagttt ttttaaagat lloS 
^Stattta i^f55 aa f aaaaaaaa ? c agacttctgg ttcaagatag ctcactggaa 1260 
""^"^ cctctttctt tcctaaattg cattgaattg ataggaagga tggcggaatc 1320 
^fr aa fS a i ^5 aaC ^^9***** aatagaaaat gcacataagc aaaaggaaac 1380 

tttttfrnZZ tat ^ ttt, T aa agataactac tcttaaaacc ttgagtatct tttcagacct 1440 
ttttcttggc aaatgaatcc atattgacat atttgatttt tttaaaaaca tggaaacgta 15O0 
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ctgttttgta aattcttttt aactgcacat ctactgttca taaatatacc tctgtaacat 1560 
aactttttgt ggttctaatg tactctgttg tatagctaat acaagttagg atgcttttgg 1620 
ccagaggtaa cagtgtccaa atataattgg cctaagtaac ctaggaaatt gtttgacata 1680 
acacagggtt caggggtgtc attaaagaca cacttttttt gccttgacct cagttggttt 1740 
gttttgcctt aggctattcc acctctcgat cacaagagct gctgctataa cttccagttt 1800 
tacatctttg taaaattgta tctcaaaggc agaaaaggcc atttcgtctc tcatttgttt I860 
tatccatgag gaagattttt aacaaaagcc tccagaagat ttcccctcag tttccattga 1920 
cttagatcag gttacagaga aaggcaatgt ctgacatttt tggtetctgt tagaagtaga 1980 
ctctgttgaa aagaaagaag ctaagctagg tgtgaagaat ggaattggaa gcccactgcc 2040 
ttcccataag aaaggtttac cataatttac tcactttttt ctgtgttaga cattttgatt 2100 
atctgcagtt tattactaca agcagtggca gagtgaatgt ccttgtacat tttgagttac 2160 
atgcttaatt atgtccttga gaaagtttct aaaagtggaa tgattggttt gactggttca 2220 
taqggcttta attatacaat ttacccctct aattagtact atatgtatgt gacttccctc 2280 
cccctgccag aatactcctt ggtcaattgt aggtattctt tttggtttaa tttttgccaa 2340 
tgtaattaaa aaatggtatg tcatttttaa aatttgtatt tctttcatta caaataagat 2400 
tgttatgtca gtattgttat tggcttttcg tattcctctt aacgtgaacc gtctgttcat 2460 
tgtttttacc tgttttcgtt ttagcaagta gtacttaatt taaagtgtga acttaatata 25.20 
taagatgcca ggaccatcat attgatgaca aaaatctatt atgtggttgt agtgcatgtc 2580 
ttggagttaa actcttaagt atcaagaagt aaaataattt cattgtttct gttactaaca 2640 
ttaaaagtgt gcaaattgta taaaattagg tacttcgatt agtaaaagaa aaaaatttga 2700 
atgttttttt ttttatttta ttaagcatgc ccagttatgc caagcatagt aaataaaggt 2/&o 
caagtagcat ttataataga ggaagtattg ttatccctag catgagtgta atggtgatat 2820 
gaaaaacttt gtcttgtcat tataataata aaaaaatgaa catttattat ggaatttcaa 2880 
aaaaaaaaaa aaa 

<210> 27 
<211> 2276 
<212> DNA 

<213> Homo sapiens . -■■ 

<220> 

<221> misc_f eature . , 

<223> Incyte ID No: 1258888CB1 

<400> 27 , n 
gcgagtggag cggaggaccc gagcggctga ggagagagga ggcggcggct tagctgctac .bu 
ggggtccggc cggcgccctc ccgagggggg ctcaggagga ggaaggagga cccgtgcgag 120 
aatgcctctg ccctggagcc ttgcgctccc gctgctgctc tcctgggtgg caggtggttt 180 
cgggaacgcg gccagtgcaa ggcatcacgg gttgttagca tcggcacgtc agcctggggt 24 0 
ctgtcactat ggaactaaac tggcctgctg ctacggctgg agaagaaaca gcaagggagt 300. 
ctgtgaagct acatgcgaac ctggatgtaa gtttggtgag tgcgtgggac caaacaaatg 360 
cagatgcttt ccaggataca ccgggaaaac ctgcagtcaa gatgtgaatg agtgtggaat 420 
gaaaccccgg ccatgccaac acagatgtgt gaatacacac ggaagctaca. agtgcttttg 480 
cctcagtggc cacatgctca tgccagatgc tacgtgtgtg aactctagga catgtgccat 540 
gataaactgt cagtacagct gtgaagacac agaagaaggg ccacagtgcc tgtgtccatc buu 
ctcaggactc cgcctggccc caaatggaag agactgtcta gatattgatg aatgtgcctc 660 
tggtaaagtc atctgtccct acaatcgaag atgtgtgaac acatttggaa gctactactg 720 
caaatgtcac attggtttcg aactgcaata tatcagtgga cgatatgact gtatagatat 780 
aaatgaatgt actatggata gccatacgtg cagccaccat gccaattgct tcaataccca B4U 
agggtccttc aagtgtaaat gcaagcaggg atataaaggc aatggacttc ggtgttctgc 900 
tatccctgaa aattctgtga aggaagtcct cagagcacct ggtaccatca aagacagaat 960 
caagaagttg cttgctcaca aaaacagcat gaaaaagaag gcaaaaatta aaaatgttac 1U2U 
cccagaaccc accaggactc ctacccctaa ggtgaacttg cagcccttca actatgaaga 1080 
gatagtttcc agaggcggga actctcatgg aggtaaaaaa gggaatgaag agaaaatgaa 114U 
agaggggctt gaggatgaga aaagagaaga gaaagccctg aagaatgaca tagaggagcg 12UU 
aagcctgcga ggagatgtgt ttttccctaa ggtgaatgaa gcaggtgaat tcggcctgat 1260 
tctggtccaa aggaaagcgc taacttccaa actggaacat aaagcagatt taaatatctc 1320 
qgttgactgc agcttcaatc atgggatctg tgactggaaa caggatagag aagatgattt 1380 
tgactggaat cctgctgatc gagataatgc tattggcttc tatatggcag ttccggcctt 1440 
ggcaggtcac aagaaagaca ttggccgatt gaaacttctc ctacctgacc tgcaacccca 1500 
aagcaacttc tgtttgctct ttgattaccg gctggccgga gacaaagtcg ggaaacttcg lbfeu 



dyiyLLuyty aauuuuuy ~ ^ => -< -■ - icon 

aaagtggaag acagggaaaa ttcagttgta tcaaggaact gatgetacca aaagcatcat 1680 

ttttgaagca gaacgtggca agggcaaaac eggegaaate gcagtggatg gegtcttget 1/40 

tqtttcaggc ttatgtccag atagectttt atctgtggat gactgaatgt tactatcttt 1800 



atatttgact ttgtatgtca gttccctggt ttttttgata t-tgeatcata ggacctctgg 1860 
cattttagaa ttactagctg aaaaattgta atgtaccaac .agaaatatta ttgtaagatg 1920 
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cctttcttgt 
gtcatttctg 
tcctcagtat 
aaaatagaaa 
aaactatgac 
aacttgtata 



ataagatatg 
aatctttcca 
atctgatttg 
aiaaaagcaca 
atcaaagata 
tttaaattct 



ccaatatt tg 
cattatatta 
tataagtaag 
gagaaatgtt 
gacttttgcc 
ttgtaataat 



ct ttaaatat 
taaaatatgg 
ttgatgagct 
taactgtttg 
taagtggctt 
aatatccaaa 



catatcactg 
aaatgtcagt 
tctctctaca 
actcttatga 
agctgggtct 
tcatcaaaaa 



tatcttctca 
ttatctcccc 
acatttctag 
tacttcttgg 
ttcatagcca 
aaaaaa 



1980 
2040 
2100 
2160 
2220 
2276 



<210> 28 

<211> 2016 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1375891CB1 



<400> 28 

gaaaaggtac 

gggctgtctc 

tgggagctct 

agaagtgccc 

cgctgagacc 

gggagccaag 

cttcatggtg 

agctggaatg 

caccctttgc 

attggctcac 

cggcccccat 

cccccatcca 

ccatccagt c 

cctggtgact 

aacttccagc 

ctcgtccacc 

tgaggtcaca 

tcatgccacg 

acccggggcc 

aacctcagcc 

ctccatagag 

ctcctacagc 

ggacatcgca 

gactacaacc 

gaagaccacg 

gacgtgagtg 

gaagacctca 

cacgcccacg 

tcagctgcag 

ccaccgacag 

ggcagcatgt 

ggagtgtatg 

gcacatgttc 

gcattaaaat 



ccgcgagaga 

tggggtctgg 

gcaggcccca 

gctatgactc 

tcttctaggg 

agaatttccc 

ctgatcgcca 

accacagttc 

accgatgaca 

acctcc'acag 

ccagtcatca 

gtcatcaccc 

atcaccccgt 

gtcacaaaca 

atccctgggg 

tccgatccac 

gcctctgccg 

gttgggaccc 

acgaccctca 

ctctctgttg 

gctgggtcag 

ccctcggaag 

accaaggggc 

aacagcagcc 

atgaagcccc 

caggtgaaaa 

ctgaccccag 

cgcctcactt 

ccaggcatgt 

actgcagctg 

ccaagcccct 

tgtggggagg 

tgtgtttcag 

tcactcagtg 



cagccagcag 

ctctgcccct 

gcacccgcag 

tagcaccggg 

cctcaacccc 

ctgcaagaga 

cctccgtgga 

agaccatcac 

gctctgaaga 

aagctaaggg 

ccccgtcacg 

cgtcacgggc 

catggtcccc 

tcgaggttat 

cctcagacat 

cagctctgcc 

agaccctgtc 

cactccccac 

gtggagctct 

agacaccaag 

cagtgggcaa 

ccgccctcaa 

cct tccccac 

gagggacgaa 

aacagccacg 

tggaggtttc 

agtggcagaa 

ccaggtctcc 

cccgtatgcc 

cgttactgtg 

gaccccagat 

ggcttcacct 

taaagagaga 

tggcccaaaa 



ttctgtggag 

tttcttcttc 

agcagacact 

ccacgccgct 

agccggcccc 

gaccaggagt 

gacatcagcc 

aggcagtgat 

ggcaaagaca 

cctgtcctca 

ggcctcagag 

ctcagagagc 

gggatctgat 

taattgcagc 

agatctcatc 

tgactccact 

cacagccggc 

taacagtgcc 

ggtcacagtt 

ttacgtcaaa 

aacaact tec 

gaacttcacc 

cagcagggac 

cagcacctta 

cccacgactg 

ctcctcctgc 

aggctgatgc 

ttactgcgtg 

aaaagagggt 

ctgagaggta 

gtggcaacag 

gttcccagag 

cctgatcacc 

aaaaaa 



cagcggtggc 

tgctgggagg 

gcgatgacaa 

ctggaaactc 

attccagaag 

ttcacaaaaa 

gccagtggca 

cccgaggaag 

ctcacaatgg 

gagagcagtg 

agcagcgcct 

agcgcctctt 

gtcactctcc 

atcacagaaa 

cccacggaag 

gaagcaaaac 

accacagagt 

acagaaagag 

agcaggaatc 

gtctcaggag 

tttgctggga 

ccttcagaga 

cctcttcctt 

gecaagatea 

cccggacgag 

ggctgagtgt 

agcagctcca 

tcaggagagg 

gctgccccta 

cccagaaggt 

gaccctcgct 

gtgtccttgg 

catctgtgtg 



cggctaggat 

ttggggtctc 

cggacgacac 

aaaegctgag 

cagagaccag 

catctcccaa 

gccccgaggg 

ccatctttga 

acatattgac 

cctcttccga 

c t tecgaegg 

ccgacggccc 

tegctgaage 

tagaaacaac 

gggtgaaggc 

cacacatcac 

cagctgcacc 

aagtgaqagc 

ccctggaaga 

cagctccggt 

gctctgcttc 

caccgaccat 

ctgtccctcc 

caacctcagc 

gccgaccaca 

ggcttccccg 

ccgggaactc 

etaaeggaca 

gcctgggccc 

tcccatgaag 

cacatccacc 

actcaccttg 

cttccatcct 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2016 



<210> 29 

<211> 2520 

<212> DNA 

<2 1 3 > Homo sapi ens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1524355CB1 



<400> 29 

caagatggcg getggeggag ctgtcgctgc ggcgcccgag tgccggcttc tcccctacgc 60 

gctacacaag tggagctcct tttcctccac ctaccttccc gagaacattt tagtggacaa 120 

accaaatgac caatcttcaa gatggtcttc agagagcaac tatcctcccc agtacttgat 180 

tctaaagctc gaaaggcctg ctatagttca gaatatcaca tttggaaaat atgagaaaac 24 0 

tcatgtttgc aatttgaaga aatttaaagt ctttggtgga atgaatgaag aaaatatgac 3 00 
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agagctgttg tccagtggct taaagaatga 
taaaattgat gaacagatgt tcccttgtcg 
gggacccagc tttaacttta gcatctggta 
tatagtacaa ccttgtctca actggtatag 
ttgcctaaaa cacttcagac aacacaacta 
aaccaagatt gcactggaac atcccatgtt 
gggtgatttt gatgcttgcg aagagttgat 
tcagtatatc agtcaacagg aatataagcc 
caaaggtgat ggggaagata accgtccagg 
tgttcaaaca gagactgttt atttgtttgg 
cttctgggcg tacagtgtga aggagaacca 
agagaatggt cctagtgcca gatcgtgtca 
aatctacaca ttggggcgtt acttggattc 
tgacttctat cgttatgaca ttgatacaaa 
tgctgatgga gggccgaaat tggtgtttga 
tatgatctac acttttggtg gtagaatttt 
agccagtgaa ccacaattca gtggcttgtt 
acttcttcga gaggactcct gtaatgctgg 
ctgcatgtta ttccactcaa aaaatcgttg 
gacctatttg aatgatttct ttagttatga 
agatggcacc aagaaagact ctgggatggt 
tattgatcca gaactgaatg aaatacacgt 
gagggaagaa aatgttagaa attcattctg 
ttgtgtctat aagaatgatc aagctgcaaa 
agaaccatgt ccaaggtttg cccatcagct 
cttatttggt ggga'atccag gaaaatcttg 
gtcactgaag ttgtgtagac cttcaaaaga 
aagaaaacac aggtttgaag aaaaggccca 
acaaaatgat ctttatataa ctgtggatca 
gctcctggca tcagctctat tcaaatctgg 
tgtggatcac acctatgctc aaagaactca 
tgacagcatg actcctccta aaggcaacct 
tcactggaca .cagaaatgga aaacaggagt 
ctgactgaca gtaaagctgc agtgattgag 
ccatcacaag tttttaccct cttccttcat 
attcctaaat taggctaata aagtgaaatt 
atatattttt tcttacttta tcttttaaga 

<210> 30 

<211> 1954 

<212> DNA 

<213> Homo sapiens 



PCT/USOO/12811 

ttataacaaa gaaacattca ccttgaagca 360 
attcattaaa atagttccac tcttgtcctg 420 
tgttgaactt agtggcattg atgatcctga 480 
caagtaccgt gaacaggaag ctattcgcct 540 
tacagaagct tttgagtcac tgcaaaagaa 600 
aacagatatt catgacaagc tggtgttgaa 660 
tgaaaaggct gtaaatgatg gcttgttcaa 720 
acgatggagt caaatcattc ccaaaagtac 780 
aatgagagga ggccatcaga tggttattga 840 
tggctgggat ggaacacaag atcttgctga 900 
gtggacatgt atctctagag acactgaaaa 960 
taaaatgtgc attgatattc aacggaggca 1020 
ctctgtgagg aacagcaaat ctctgaaaag 1080 
cacatggatg ttactaagtg aggatactgc 1140 
tcatcagatg tgtatggact cagaaaaaca 1200 
gacttgtaat ggcagcgtag atgacagcag 1260 
tgctttcaac tgtcaatgtc aaacctggaa 1320 
tjcctgaggac atccagtctc gaataggaca 1380 
cttatatgta tttggtggcc agcgatcaaa 1440 
tgtggactct gatcatgtag acataatatc 1500 
tccaatgaca ggatttacac agagagcaac 1560 
cttatctgga ctcagcaaag ataaggaaaa 1620 
gatttatgac attgtgagga atagttggtc 1680 
ggataatcca actaaaagtc ttcaggaaga 1740 
tgtatacgat gagctacaca aggttcatta 1800 
ctctccaaag atgagattag atgacttctg 1860 
ttatttactg aggcattgca agtacctcat 1920 
agtggatccc cttagtgctc tgaaatattt 1980 
ttcagaccca gaagagacaa aagagtttca 2040 
ttcagatttt acagctctgg gcttttctga 2100 
gctctttgac accttagtaa atttctttcc 2160 
ggtagacctc atcacactgt aactgaagag 2220 
cgattttccg tcttttggat tgcagctcca 2280 
gactgcacca gagttctgaa gggatcttaa 2340 
gcctgacctc aaccccgctc tcctcatcct 2400 
ggtatacttt ccagttaaat atatatatat 2460 
attaatgagt ataaaagcaa aattaggcag 2520 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1598937CB1 



<400> 30 

ccgagttatg 

gaggccagag 

gcgtcatctg 

ttggacaccc 

gtgctgcggg 

tttctgggag 

gtgcggggac 

gcagt tggct 

cacactggag 

cctggaggtg 

gggggctttc 

cttccgcaag 

aagctaccac 

ggcagacagc 

tgccaaggag 

ctacactcca 

gcgccagcgg 

ttccagcagt 

cagtgacgaa 



cagctccggg 
gcgcgtacct 
gaagcaggca 
gggtccttga 
gacaccatgg 
gctggcaact 
ctggtcagct 
gactgggccc 
aaggcctggc 
cgggagaagc 
caccggcctg 
gcccagaagc 
gcagcccgga 
gccgtctccc 
gccgagaaga 
cgctacatgg 
cttcttttct 
gagaagttcc 
gaggatctgc 



cggcaagggg 
ctgggcgccg 
ccccggccgg 
agtgatctct 
ctccagaaga 
acaggcgcac 
gcttccagga 
gaaagtggag 
atgccttttt 
tgcaagggca 
tgctgggcgg 
cctggctgaa 
aggatgagaa 

caaaagctca 
aggacatgga 
tcaaggatat 
atgaactcca 
gctggtggcg 



tcctgtcgag 
agctctggag 
caggcagagt 
aggccccagc 
ggacgctgga 
ggtacagcgg 
gcgcgcccgc 
ggggaccgtg 
cacggcggct 
ggacagtgag 
cttccgcgag 
gaggctgaag 
gaccgcccag 
gcgcaaactg 
gtatgagcag 
acaggccttt 
gc tgctcacc 
ccgtgac ttg 
cagcacccac 



gggcgcggtc 
tggaggctgg 
cacggtggca 
cccaaatccg 
ggggaggcct 
gtggaggacg 
atcgagaagg 
gagaagggcc 
gagcggctga 
cgggtgcgcg 
agccgggcgg 
gaggttgagg 
acgagggaga 
caggaacggg 
acgctggcag 
gagacctgcc 
ttacaccagc 
caccagggca 
gggccaggca 



cataagggtt 
ggttcggagt 
gcattgagag 
ccaccat tec 
tagggggcag 
ggcaccggct 
cttatgccca 
cccagtatgg 
gcgcgctgca 
cc tggcagcg 
ccgaggaegg 
cttccaagaa 
gccacgcaaa 
tggaacgctg 
agetgeateg 
aggccgccga 
acctggacct 
ttgaggcagc 
tggccatgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
9 00 
960 
1020 
1080 
1140 
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«S?™^ aS ttc 9 a 99 a 9 t 9gtccttgga cacacagagg acaatcagcc ggaaagagaa 1200 
raring 9 agccct 9 at 9 aggttaccct gaccagcatt gtgcctacaa glgatggcac 1260 
t«aoaa»t ^r" 9 ' 0 " Cggggtcccc »fl9«cgflgg caggatgagg ag?ggtcaga IbIS 
taacttaall S S ^tgccaccgg ggttcgggtg agggcactct atgactacgc 1380 
gctgatgagc tgagcttccg agcaggggag gagctgctga agatgagtga 1440 

ITccltTJ: SEKSfSi ? CCaaggC ? a ^tgcagagt ggScgSattg g?ctgtac2c "oS 

^^f aa 5 gtggagtgtg tgggcgcctg agtgtcctga cagcccttct gcaacgttta 1560 

r"*?^ tgg "cagagccc agcttctcct ggagagccgg accctcaggg LctgLccg 1620 

aSaaccttJa gC * gctcctc tgtcccttga gggaggaagt cctgggal?? agggaggggl 1680 

tnnntnntr 9 tcta f^^g ggactggtag ggaagggacg agtctaggct gagggcaaga' 1740 

c?SE??~? gagg ^ a "9 aagcgttcag gggtgcctgg gcctccccag Iagl?gtgga 1800 

ctcagttcct gacctctgct ttggggttcc tggggtgggc ttggggtgag tgtagttdtg 1860 

IcacafaXc ^ cctctt ^ ggcttgtt ^ agcgtg?f?t aaSStJS a?acScccJ? 
acacaaaaac aaaaacacca aaaaaaaaaa aaaa 



1954 



<210> 31 

<211> 1817 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1725801CB1 

<400> 31 



catoalctac llllltll^ cacggc 9 ccc Q^gggctggg gcggtcgctt cttccttctc 60 
cgtggcctac gagggtcccc agcctgggta aagatggccc catggccccc gaaoaoccta 120 
gtcccagctg tgctctgggg cctcagcctc ttcctcaacc tccclgga?? tatc?SScS 180 
cagccctctc cacctcccca gtcttctccc ccgcctcagc cccatffgtg tcatalltgc 240 
StSE^? " gacagctt taacaagggc ctggagagaa ccatccgSgf caactttgga 300 
ggtggaaaca ctgcctggga ggaagagaat ttgtccaaat acaaagacag tgagacccac 360 
ctggtagagg tgctggaggg tgtgtgcagc aagtcagact tcgagEgcS lltlltllll 420 
gagctgagtg aggagctggt ggagagctgg tggtttcaca agLgcagga tlocllltll 480 

ccctcctolc S?c^St tC agattCCCtg aa 9ctctgct gSccSgcgS S"S?S5S £S 

ttccctgtcc tgggggaaca gagaggccct. gcggtggcta cgggcagtgt 600 

gaaggagaag ggacacgagg gggcagcggg cactgtgact gccaagccgg ctacdggqgt 660 

gaggcctgtg gccagtgtgg ccttggctac tttgaggcag LcgcIacSc caJcS?I?g 720 

tSSESS 111T " tgg CCCCtgtgcc cgatgcfcag gadtgSE atcfaactgl ill 
ttgcaatgca agaagggctg ggccctgcat cacctcaagt gtgtagacat tgatgagtat 840 
ggcacagagg gagccaactg tggagctgac caattctgcg Eglaclctga gScfc?tSt 900 
SSiSSSS ocSS 3 ~«=tBcct» ggctgcaJg? gggcaggg^c fgtcgc^t III 
aagaagtgta gccctggcta tcagcaggtg ggctccaagt gtctcgatgt ggatgagtgt 1020 
facat 3 ?^ tgtgtccggg agagaacaag cagtgtgaaa acaccgaggg Sggttltcgb lolo 
o£?cao«S octX caa 9 ca ^tg gaaggcatct gtgtgaa^I gclgatccSa 1140 
gag ^ ag ^ a ? gCt 5 Ct 5 C 5 C agagatgaca gaagacgagt tggtggtgct gcagcagatg 1200 
accgccf?ct tcatta^S r tg " Ctggcc acgctggctg ctaaggglga SttggtgetS 1266 
f^f^tn^ tcattggggc tgtggcggcc atgactggct actggttgtc agagcgcagt 1320 
Ktcccacfc SS" catcaa 9^^ agataatcgc ggccaccacc tgtfggacct Jsso 
ttaatttltt t??Z»!£% cagagctt 99 gctgccctcc tgctggacac tcaggacagc 144 0 
ttggtttatt tttgagagtg gggtaagcac ccctacctgc cttacagagc agcccaoota 1500 
cccaggcccg ggcagacaag gcccctgggg taaaaagtag ccctgafgg? gSSSStS lieo 
Saata^oa ^ gg ^ gggga ^ggcaggct tcacaafgtg tgaafttcaa al^tt^c? lll°o 
£ ctgctagagc tttggcccct gcttaggatt aggtggtcct cacaggggtg 1680 
StccSwS ccc 'tt^ tg ^ a ?^ 9 , C atgct S cca 9 ttcctgttct gtgttSScZ 1740 
aagt^ata aaaaaaf ° aCttatttat 'catctcagg aaataaagaa aggtcttgga 1800 



1817 



<210> 32 

<211> 2694 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 1730482CB1 

<400> 32 
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gacctagtgt ' gagcataatg gaaaaaacac 
ctccaaccat tccaccagca aaagaagtat 
tggtggatgg atcctggagc attggagatg 
acagcactgt tggagccctg aacaagattg 
agttcactga tgatcccaga acagaattta 
ttcttgatgc aattaaacac atttcataca 
ttaagtatgt tcgagatacc ttgttcactg 
aggttatcgt ggttataact gatggaagat 
agatgcaatt agatggctat agcatttttg 
agttggttag cattggcagt aagcccagcg 
acgcctttaa gaaaatcgaa gatgagttaa 
cctgtccagt ggtacacaag gatggcattg 
ttggtttggt tgaaaaagat ttttcatcag 
tcaatgtgtt tccatgttac caactccata 
acttgcaccc agaaggattg ccctccgact 
ctgacactcc acaggagcca tttgctcttt 
tggttggggt -tattttagac aatggtggga 
gtggggattt tcaaactgtt actttcgaag 
gctttcacaa gctacacatt gttgtcagtg 
agcaagtggg tgagaaggca atgaacgcat 
tgctagggaa aatggttcga tcaagaggac 
agatgtttga tattgtttgc tccacatcat 
caggcctgag agatgatgag tcttgcccag 
ccaatgaagt ggctctggga ccagcgggcc 
agggccagca aggtgaaccg ggtccaaagg 
tgccaggacc tcagggtcca cctggacctc 
tgcccggaat gccaggagaa aaaggagaga 
gtatcccagg aggcgttggt tcaccaggac 
cgggaaagga tggatcctcg ggacctccag 
cccctggagt cccagggatc acaggaagca 
gtgtccctgg agcaaagggg gaacgaggag 
tgagatcagt ggcgcgtcaa gtatgcgaac 
ctgccatcct caaccagatt cccagccact 
ctggggagcc tgggaggcca ggctcacctg 
caccaggctt ccccggaaat gcaggcgtgc 
gtatcaaagg agaaaaagga aatccaggcg 
gaccagcagg accttcaggg gagagtcggc 
gaccaagagg cccaccaggt catctggggg 
ctggatattg tgacccctca tcatgttctg 
agccagagtt cacccctgtc caagatgagc 
tctgatagcc tcaggagaaa tttgaagacc 
gagaaaatgt tgttatgtgg tttgtatgct 
cctaaatctc ctccttggat aatgttaata 
ttttaaaaag ttcccttaat ctatgacatg 
ggcatgtcag ataatttgtt tttccagaga 

<210> 33 

<211> 1149 

<212> DNA 

<213> Homo sapiens 



♦ PCT7US00/12811 

aatcacttcc tacacgacca ccaacttttc 60 
gtaaggcggc caaggctgac ctggtattta 120 
aaaatttcaa taagatcatc agctttctat 180 
gcacagatgg aacccaagtt gcaatggttc 240 
aactaaatgc ttacaaaacc aaagagactc 3 00 
aaggaggaaa tacaaaaaca ggaaaagcaa 3 60 
cagagtcagg tacaagaagg ggcatcccaa 420 
cacaagatga tgtgaacaaa atctccaggg 480 
caattggtgt ggccgatgca gattactcgg 540 
cacgccatgt cttctttgtg gatgactttg 600 
ttacttttgt ctgcgaaaca gcatcagcaa 660 
atcttgcagg atttaagatg atg^aaatgt 720 
tggaaggggt ttctatggag cctggtacct 780 
aagatgccct ggtttcccag ccaaccaggt 840 
acacaatcag ttttctattc cggattcttc 900 
gggagatttt. aaataaaaat tctgacccat 960 
aaactctaac atatttcaac tatgaccaga 1020 
gacctgaaat taggaaaatt ttttatggaa 1080 
agactttggt caaagtggtt attgactgca 1-140 
cagctaatat cacgtcagat ggtgtagaag 1200 
caggtggaaa ctctgcaccg ttccagttac 1260 
gggccaatac agacaaatgc tgtgaacttc 1320 
accttcccca ttcctgctcc tgttctgaaa 1380 
caccaggtgg tccaggactc cgaggaccaa 1440 
gaccagatgg ccotcggggt gaaattggtc 1500 
aaggaccaag tggtctgtcc attcaaggaa 1560 
aaggagatac tggccttcca ggtccacagg 1620 
gtgatggctc accaggccag aggggccttc 1680 
gaccaccagg gccaataggc attcctggca 174 0 
tgggaccgca aggcgccctg ggaccacctg 1800 
agcggggtga cctgcagtct caagccatgg 1860 
agctcatcca gagtcacatg gccaggtaca 1920 
cctcatccat ccggactgtc caagggcctc 1980 
gagcccctgg tgaacaagfga cccccaggca 204 0 
cagggacccc aggagaacga ggtctaactg 2100 
ttggaaccca aggtccaaga ggcccccctg 2160 
ctggcagccc t;gggccccct ggctctcctg 2220 
ttcctggacc ccaaggtcct tctggccagfc 2280 
cctatggtgt gagagctccc catccagatc 2340 
tggaagccat ggaactgtgg ggccctggag 24 00 
aactgcaaga actcttaagg aatcttgttt 24 60 
acttttgggg ggcagggctc atttcagcag 2520 
ttattattat tattaacaaa aaatatatat 2580 
gtagcaatga tttccctttg gtgtcttaat 2640 
agagagctca aagaggaatt ggga 2694 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1810058CB1 

cagtatctgg gtccagcctg cagccttagg gtccaggtga tgttaccgtg tgtgtggccc 60 

ttcttcacag tggcctccta gaaaaacaag accctgactc aaagaacacc tctcactaca 12 0 

ttcagagtct gtcatctgaa ccatgaggat ctggtggctt ctgcttgcca ttgaaatctg 180 

cacagggaac ataaactcac aggacacctg caggcaaggg caccctggaa tccctgggaa 240 

ccccggtcac aatggtctgc ctggaagaga tggacgagac ggagcgaagg gtgacaaagg 300 

cgatgcagga gaaccaggac gtcctggcag cccggggaag gatgggacga gtggagagaa 3b0 

gggagaacga ggagcagatg gaaaagttga agcaaaaggc atcaaaggtg atcaaggctc 420 

aagaggatcc ccaggaaaac atggccccaa ggggcttgca gggcccatgg gagagaaagg 480 

cctccgagga gagactgggc ctcaggggca gaaggggaat aagggtgacg tgggtcccac b4U 

tggtcctgag gggccaaggg gcaacattgg gcctttgggc ccaactggtt taccgggccc 600 

catgggccct attggaaagc ctggtcccaa gggagaagct ggacccacgg ggccccaggg 660 

31/43 



BNSDOCID: <WO 0066380A2_I_> 



WO 00/68380 PCT/USOO/12811 

gtccggggaa taagaggctg gaaaggagat cgaggagaga aagggaaaat 72 0 
S?t2c?tcJ ^2i^ gC Caa t aagtgC tttcact 9tg gggclclcgg tg??Sagcaa 780 ' 
aaatott^ nf^rl 9C " attaaatt tgataagatc cacatcactg tHttctccag 84 0 
gtgtctttgg tcaaaaacgg agtaaaaata ctgcacacca gagatgctta 900 
gaggaccagg cctctggcag cattgtcctg cagctgaagc Ecggggatga 960 
gatgtggctg caggtgacag gaggagagag gttcaatggc ttgtttgctg atglSjacga 1020 
tgacacaact ttcacagggt tccttctgtt cagcagccag tglcagaggl gaSttatE 10sS 
JtStt^a 3 CCatCCatCa fl«tcagctt gggatgaac? tSttcSgatg gttSSSS lllo 

1149 

<210> 34 

<211> 1215 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2040679CB1 ' ' 

<400> 34 

SSSScE ggaaca ^5 «SSSS a aaaa SSc ta SSSS?E"SS5SS 3 24 00 ° 

SSfSfiJ aaaaagcctt aggaaattag acttgagtca caaccatita aaaaagc?tc 360 
S?ca a t a ao a 9 ?L 9 ^ ^ ^acaccttc aagaacttaa cctgaatgac aatcacttgg 420 
agtcatttag tgtagccttg tgtcattcta cactccagaa gtcacttcgq aotttooaec 480 
att?aa a a a f ?" Mt «"? ffcactccctg tgcagttttg fcagctccfg gfactff ajf 540 

acctt aaa tj SaS ^ aa " ga ^ C aatttcctt 9 caagataggf caactaatfa 600 

acctccgcct tttgtcagca gctcgaaata agcttccatt tttocctaat rrsat- t-r»rr« «n 

caataataaa actacaa" 9 ^aatacttt tgalca^a laagtccftc £S 

cagtaataaa gctgcaagca ccattaactt tattggaatc ttctacacaa arrai- a t-^r -7«n 

ISSSSSS ^"S^ ? I = 
KSSK SSUSSS ssssk'sjssu? ~ I 0 

assss: sssss ss:.^ &sss ii 

???SaL C Ka »IL" Ct " t=c»„.ctt t Ltt.«a? IISS!?St JJJS 



1215 



<210> 35 

<211> 1300 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eatiire 

<223> Incyte ID No: 2960051CB1 

<400> 35 



ttctgcacaa agaactgaaa ggcatttatc cccaagggag gcagttattt taaattttae fin 
taagaagttc agcaaatact tttcaacatt cccttctgtc ?tt?ct?tct ttttaaaSS iIS 
agctctgatt ttgtttcatt ttcagctgga gacttaaltg acacSaagfa aagccKS? Ho 
SSSXf^ ^fK"" 33333 atcaa ^catgaaga? KSgt^g Ho 
llllllttll £™S«£i»J catttttcaa actgactttg gaaaaaatga agaaattcct 300 
aggaagcaaa ggaggaagat ctaccacaga aggttgagga aaagttcaac ctcacacaaa 3 60 
cacagatcaa acagacagct tggaattccg caaacaacag tttttacacc agtagcaa^S 420 
cttcctattg ttaactttga ttatagcatg gaggaaaagl ttgaatcctt ?tc2S???? \lo 
cctggagtag aatcaagcta taatgtgtta ccaggaaaga agggacactg tttSofaaaS ill 
ggcataacca tgtacaacaa agctgtgtgg tcgcctgagc c??|?a"a? cte???c?SI 60S 
tcagatggaa gagttctttg tgatgaaacc atgtgccatc ccclgaggtg Iccccaaaca 660 
gttatacctg aaggggaatg ctgccctgtc tgctccgcta ctggLJagl gttltlglll "S 

a ??aa a at a a llltllf^t " Mtc J"« -cttccattt gtttttgtt. S?££StSS ill 
attaaaatta tgatacatta gtcagatctg agtacttaaa atattggcaa aatgctgatt 840 
aacatagaaa atatctggga aaatgtatgg taggggatat aaatal?aga ctgtgg??" 9oS 
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atagttctag'ctctatcaga ttcagtaaac ttggatgaga ttacattcca catttgactc 960 
tcagctttag agatatggta acagaatttc tacaacagat cctgaattct tattgcatta 1020 
aaaactctgc tttggtctat atgtgcatta tcccacttaa tccagtgcaa cgtgccttta 1080 
tcalcttgla gccagggtaa acaaaggaag agtgatttgc atctaaagag aacaaagccc 1140 
caaccctctg gctataccca accactcaaa ggcagcacag gaacccacat cactgcttgg 1200 
ataatcccag gaaaatgcag aaaaagtgta gcctgaagca tgattttctc atgtggcact 1260 
tctgtgtgca ggagatcaca gcgcggtttt gttgctgcca 



<210> 36 

<211> 1562 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3117318CB1 

<400> 36 



aaggccgggc gcggtaagag cgtctcgggg agtagggcaa ggcggccggg cccctcccat _60 
Iccgcclllt cttcagcgtc ctgcccgcgg cactggctgc gggtgccggg ccacctgcga 120 
■atotacqcag ggactctgga cacccgcggc ggcgagctga gggagcagtc tccacgagga 180 
cccaggcgga ccctctggdg ccatgcgcgc cctccccggc ctgctggagg ccagggcgcg 24 0 
tacgclccgg ctgctcc^cc tccagtgcct tctcgctgcc gcgcgcccaa gctcggcgga 300 
cggcagtgc? ccagattcgg cttttacaag tccacctctc agagaagaaa taatggcaaa 360 
t^ c tft?cc ?tggagag?c ataacatatc actgactgaa cattctagta tgccagtaga 420 
aaaaaatatc actttagaaa ggccttctaa tgtaaatctc acatgccagt tcacaacatc 480 
tgggga£ttg ta^gcagtaa Ifgtgacttg gaaaaaagat ggtgaacaac "gagaataa 540 
t?a?lttgtc agtgcaacag gaagcacctt gtatacccaa tacaggttca ccatcattaa 600 
tagcaaacaa atgggaagtt attcttgttt ctttcgagag gaaaaggaac aaaggggaac 660 
atttaatttc aalgtccctg aacttcatgg gaaaaacaag ccattgatct cttacgtagg 720 
^" Uctact qtcttgacat gtaaatgtca aaattgtttt cctttaaatt ggacctggta 780 
^Ktagtaat gggag?gtaa aggttcltgt tggtgttcaa atgaataaat atgtgatcaa 840 
tggaacatat gctaacgaaa caaagctgaa gataacacaa cttttggagg aagatgggga 900 
atcttactgg tgccgtgcac tattccaatt aggcgagagt gaagaacaca ttgagcttgt 960 
ggtgctgagc tatttggtgc ccctcaaacc atttcttgta atagtggctg aggtgattct 1020 
Utagtggcc accattctgc tttgtgaaaa gtacacacaa aagaaaaaga *9cactcaga 1080 
tgaggggaaa gaatttgagc agattgaaca gctgaaatca gatgatagca atggtataga 1140 
aaataatqtc cccaggcata gaaaaaatga gtctctgggc cagtgaatac aaaacatcat 1200 
gtcgagaa^c attgglagat Itacagagtt cgtatttcag ctttgtttat "ttcctgtt 1260 
lagaglctct gag?ttttag ttttaaaagg atgaaaagct tatgcaacat gctcagcagg 132Q 
agcttcatca Icgatatatg tcagatctaa aggtatattt tcattctgta attatgttac 1380 
aLaaagcaa tgLaatcag aataaatatg ttagaccaga a acaaa a »"atattct 1440 
ggtcttcaaa ggacacacag aacagatatc agcagaatca cttaatactt catagaacaa 1500 
laatcactca laacctgttt ataaccaaag aattcatgaa aaagaaagcc tttggccatt 1560 



tg 

<210> 37 

<211> 2801 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 2793 

<223> a, t, c, g, or other 
<220> 

<221> misc_feature 

<223> incyte ID No: 3486992CB1 



60 



oactttgctt gaatgtttac attttctgct cgctgtccta catatcacaa tatagtgttc 

Lgttttgtt aaaactttgg ggtgtcagga gttgagcttg ctcagcaagc "S^ggct 120 

aaaataaqct ttgttatagc agcttgccaa ttggtgctgg gcctactaat gacttcatta 180 

accaagt?tt ccatacagaa tagtgagtgt ccacaacttt gcgtatgtga aattcgtccc 240 

tgg?t?accc cacagtcaac ttacagagaa gccaccactg ttgattgcaa tgacctccgc 300 

tKacaagga ttcccagtaa cctctctagt gacacacaag tgcttctctt acagagcaat 360 
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aacatcgcga agactgtgga tgagctgcag cagcttttca acttgactga actagatttc 420 
teccaaaaca actttactaa cattaaggag gtcgggctgg caaacctaac ccagctcaca 480 
acgctgcatt tggaggaaaa tcagattacc gagatgactg attactgtct acaagacctc 540 
agcaaccttc aagaactcta catcaaccac aaccaaatta gcactatttc tgctcatgct 600 
tttgcaggct taaaaaatct attaaggctc cacctgaact ccaacaaatt gaaagttatt 660 
gatagtcgct ggtttgattc tacacccaac ctggaaattc^ tcatgatcgg agaaaaccct 720 
gtgattggaa ttctggatat gaacttcaaa cccctcgcaa atttgagaag cttagttttg 780 
gcaggaatgt atctcactga tattcctgga aatgctttgg tgggtctgga tagccttgag 840 
agcctgtctt tttatgataa caaactggtt aaagtccctc aacttgccct goaaaaagtt 900 
ccaaatttga aattcttaga cctcaacaaa aaccccattc acaaaatcca agaaggggac 960 
ttcaaaaata tgcttcggtt aaaagaactg ggaatcaaca atatgggcga gctcgtttct 1020 
gtcgaccgct atgccctgga taacttgcct gaactcacaa agctggaagc caccaataac 1080 
cctaaactct cttacatcca ccgcttggct ttccgaagtg tccctgctct ggaaagcttg 1140 
atgctgaaca acaatgcctt gaatgccatt taccaaaaga cagtcgaatc cctgcccaat 1200 
ctgcgtgaga tcagtatcca tagcaatccc ctcaggtgtg actgtgtgat ccactggatt 1260 
aactccaaca aaaccaacat ccgcttcatg gagcccctgt ccatgttctg tgccatgccg 1320 
cccgaatata aagggcacca ggtgaaggaa gttttaat.cc aggattcgag tgaacagtgc 1380 
ctcccaatga tatctcacga cagcttccca aatcgtt taa ' acgtggatat cggcacgacg 1440 
gttttcctag actgtcgagc catggctgag ccagaacctg aaatttactg ggtcactccc 1500 
attggaaata agataactgt ggaaaccctt tcagataaat acaagctaag tagcgaaggt 1560 
accttggaaa tatctaacat acaaattgaa gactcaggaa gatacacatg tgttgcccag 1620 
aatgtccaag gggcagacac tcgggtggca acaattaagg ttaacgggac ccttctggat 1680 
ggtacccagg tgctaaaaat atacgtcaag cagacagaat cccattccat cttagtgtcc 1740 
tggaaagtta attccaatgt catgacgtca aacttaaaat ggtcgtctgc' caccatgaag 1800 
attgataacc ctcacataac atatactgcc agggtcccag tcgatgtcca tgaatacaac 1860 
ctaacgcatc tgcagccttc cacagattat gaagtgtgtc tcacagtgtc caatattcat 1920 
cagcagactc aaaagtcatg cgtaaatgtc acaaccaaaa atgccgcctt cgcagtggac 1980 
atctctgatc aagaaaccag tacagccctt gctgcagtaa tggggtctat gtttgccgtc 2 040 
attagccttg cgtccattgc tgtgtacttt gccaaaagat ttaagagaaa aaactaccac 2100 
cactcattaa aaaagtatat gcaaaaaacc tcttcaatcc cactaaatga gctgtaccca 2160 
ccactcatta acctctggga aggtgacagc gagaaagaca aagatggttc tgcagacacc 2220 
aagccaaccc aggtcgacac atccagaagc tattacatgt ggtaactcag ' aggatatttt 2280 
gcttctggta gtaaggagca caaagacgtt tttgctttat tctgcaaaag tgaacaagtt 2340 
gaagactttt gtatttttga ctttgctagt ttgtggcaga. gtggagagga cgggtggata 2400 
tttcaaattt ttttagtata gcgtatcgca agggtttgac acggctgcca gcgactctag 2460 
gcttccagtc tgtgtftggt ttttattctt atcattatta tgattgttat tatattatta 2520* 
ttttatttta gttgttgtgc taaactcaat aatgctgttc taactacagt gctcaataaa 2580 
atgattaatg acaggatggg gttcccctgt gcttttacca gtagcatgac cccttctgaa 2640 
gccatccgta gaaagtactt tgtccccaaa aagcaacata cggtttgaac a,gcatgaaac 2700 
tttgtagcat cgggctaaga ctttaactca gagcaaggca gactggtacc tcgttaagat 2760 
gtagtgactg cggatgttta cactgaatga agntgcttaa t 2801 

<210> 38 

<211> 2597 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4568384CB1 

<400> 38 

ccagaggcaa agaggccagt gaggactgct ctgtgcagtt gtccaggcct gaagaaggtg 60 
gcggtgattt gaatcagaag aggtggcatt ctcttcatta ggaggatatg gataatcaag 120" 
acctaacaaa ttgggctccg aaagatacct tcgtcatggg gaagagtgaa tggagagaag 180 
ataatataaa atgccaaacg tatataaatt gaaaggggaa aattgagaga acttgcgatc 24 0 
catggtcttg gtcttccata aaggggaact gggacatcca ctggagcaga gcacagattg 300 
gcccaagagc cccaagactc ccactggcct ccgcagaggc cgacagtgta ttcgtcctgc 360 
ggagattgtg gcttccctgt tagaaggaga ggagaacacc tgtggcaaac agaaaccaaa 420 
ggaaaacaat ttaaagccaa aatttcaggc tttcaaggga gtaggctgtc tatatgaaaa 480 
ggagtcaatg aaaaaatcct tgaaagacag tgttgcctct aacaataaag atcagaattc 540 
catgaaacat gaggatccca gtatcatatc catggaagat gggtccccat atgttaatgg 600 
ctcattaggt gaagtgactc catgccaaca tgcaaagaag gcgaatggcc caaactatat 660 
tcagcctcaa aaaagacaga ccacttttga aagccaggat cgcaaggcag tgtcccctag 720 
cagttctgaa aagagaagta agaatcctat ttctaggcca ttagaaggta agaagtcctt 780 
aagtcttagt gcaaagactc acaacatagg ctttgacaaa gacagctgcc atagtaccac 840 
aaagacagaa gcttcacagg aagagcggtc tgattcaagc ggcctcacat ctctcaagaa 900 
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atcaccaaag gtctcatcca aggacaotcg ggaaatcaaa actgatttct cactttctat 960 
tagtaattcg tcagatgtga gtgctaaaga taagcatgct gaagacaatg agaagcgttt 1020 ■ 
ggcagccttg gaagcgaggc aaaaagcaaa agaagtgcag aagaagctgg tgcataatgc 108O 
Ectggcaaat ttggatggtc atccagagga taagccaacg cacatcatct tcggttctga 1140 
cagtgaatgt gaaacagagg agacatcgac tcaggagcag agccatccag gagaggaatg 1200 
ggtgaaagag tctatgggta aaacatcagg gaagctgttt gatagcagtg atgatgacga 1260 
atctgattct gaagatgaca gtaataggtt caaaattaaa cctcagtttg agggcagagc 1320 
tggacagaag ctcatggatt tacagtcgca ctttggcacc gatgacagat tccgcatgga 1380 
ctctcgattt ctagaaactg acagtgaaga ggaacaggaa gaggtaaatg aaaagaaaac 144 0 
tgctgaggaa gaagagcttg ctgaagaaaa aaagaaagcc ctgaatgttg tacaaagtgt 1500 
tttgcaaatc aacttaagca attctacaaa cagaggatca gtagctgcta agaaatttaa 1560 
ggacatcata cattatgatc caacgaagca agaccatgcc acttacgaaa gaaaaagaga 1620 
tgataaacca aaagaaagta aagcaaaacg aaaaaagaaa agggaggaag ctgagaaact 1680 
acctgaggtg tctaaagaaa tgtattataa tattgctatg gatctgaaag aaatattcca 1740 
aactacaaaa tataccagtg aaaaggaaga gggcacaccc tggaatgagg actgtggtaa 1800 
agagaaacct gaggaaatcc aggaccctgc agctctgacc agtgacgctg agcagcccag 1860 
cgggttcacg ttctcttttt ttgattcaga cactaaagac ataaaggaag agacctacag 1920 
agttgaaaca gtgaaacctg gaaagattgt ctggcaggaa gaccctcgtt tacaagacag 1980 
cagttcagaa gaggaagatg ttactgaaga aacagatcac agaaactcca gtcctggaga 2040 
agcatcatta cttgagaaag agaccactag atttttcttt ttctctaaga atgatgaacg 2100 
acttcaaggt tctgacttat tctggagagg agtaggaagt aatatgagca ggaactcttg 2160 
ggaggccaga acaaccaacc tgcgtatgga ttgtcgaaag aaacataaag acgcaaaaag 2220 
gaaaatgaaa ccaaaataat aaatgtcagc tggttttgat actgaatgtg aacaaggctc 2280 
acctaaggaa actgacccag aaaacagttt tagctgacaa agaagaaatt tcagagtgaa 2340 
ggaatttlaa aaatctggct gacggaatat cattctggtt .gccatctttt tctgtggaac 2400 
tcctctqcat ttcttcctaa gtaattactt caaaaattaa attcaacttc ttataaagga 2460 
agaacaagat agtccttgaa aatacttttt gtatataatc tctttgccct ctatcctgag 2520 
taactaatgg acatcttctc atgcaaggtt tatatgaagc ctttttaaat aaatgagtca 2580 
aagcaaaaaa aaaaaaa 

<210> 39 
<211> 2641 

<212> DNA ' 
<213> Homo sapiens 

<220> . . • ' 

<221> misc_feature / 
<223> Incyte ID No: 4586187CB1 

ctgggaagaa agctatcagc accaactcag aactctccac cttcagatca gacattctag 60 
atctccgtca gcaacttcgt gagattacag aaaaaaccag caagaacaag gatacgctgg 120 
agaagttaca ggcgagcggg gatgctctgg tggacaggca gagtcaattg aaagaaactt 180 
tggagaataa ctctttcctc atcaccactg taaacaaaac cctccaggcg tataatggct 240 
atgtcacgaa tctgcagcaa gataccagcg tgctccaggg caatctgcag aaccaaatgt 300 
attctcataa tgtggtcatc atgaacctca acaacctgaa cctgacccag gtgcagcaga 360 
ggaacctcat cacgaatctg cagcggtctg tggatgacac aagccaggct atccagcgaa 420 
icaagaacga ctttcaaaat ctgcagcagg tttttcttca agccaagaag gacacggatt 480 
qgctgaagga gaaagtgcag agcttgcaga cgctggctgc caacaactct gcgttggcca S40 
aagccaacaa cgacaccctg gaggatatga acagccagct caactcattc acaggtcaga 600 
tggagaacat caccactatc tctcaagcca acgagcagaa cctgaaagac ctgcaggact 660 
tacacaaaga tgcagagaat agaacagcca tcaagttcaa ccaactggag gaacgcttcc 720 
agctctttga gacggatatt gtgaacatca ttagcaatat cagttacaca gcccaccacc 780 
tgcggacgct gaccagcaat ctaaatgaag tcaggaccac ttgcacagat acccttacca 840 
aacacacaga tgatctgacc tcc.ttgaata ataccctggc caacatccgt ttggattctg 900 
tttctctcag gatgcaacaa gatttgatga ggtcgaggtt agacactgaa gtagccaact 960 
tatcagtgat tatggaagaa atgaagctag tagactccaa gcatggtcag ctcatcaaga 1020 
attttacaat actacaaggt ccaccgggcc ccaggggtcc aagaggtgac agaggatccc lOBO 
agggaccccc tggcccaact ggcaacaagg gacagaaagg agagaagggg gagcctggac 1140 
cacctggccc tgcgggtgag agaggcccaa ttggaccagc tggtcccccc ggagagcgtg 1200 
gcggcaaagg atctaaaggc tcccagggcc ccaaaggctc ccgtggttcc cctgggaagc 1260 
ccggccctca gggccccagt ggggacccag gccccccggg cccaccaggc aaagagggac 1320 
tccccggccc tcagggccct cctggcttcc agggacttca gggcaccgtt ggggagcctg 1380 
gggtgcctgg acctcgggga ctgccaggct tgcctggggt accaggcatg ccaggcccca 1440 
agggcccccc cggccctcct ggcccatcag gagcggtggt gcccctggcc ctgcagaatg 1500 
agccaacccc ggcaccggag gacaatagct gcccgcctca ctggaagaac ttcacagaca 1560 
aatgctacta tttttcagtt gagaaagaaa tttttgagga tgcaaagctt ttctgtgaag 1620 
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acaagtcttc 
agatggtagg 
ggaagtggct 
actggggtca 
ggaacgattt 
tactgtcatc 
aaaggcaaag 
gcactgaaaa 
acccaaagac 
atggactgaa 
ttccaaagta 
agaacttaag 
aaaatatgca 
aagagctttt 
tttttttttt 
ctaataaaaa 
ttcaattgtg 



PCTAJS00/12811 



acatcttgt t 
gagagagagc 
ggatgggaca 
tggccatggg 
ccaatgtgaa 
tgcattataa 
gacactcct t 
ccaa ttactg 
ttgggaacta 
tcacatagat 
tggaacactc 
catatactgt 
gacatacaga 
aaaactttgt 
ttgtcatttt 
ctcctagggt 
gctttaaaaa 



ttcataaaca 
cactggatcg 
tctccagact 
ccaggagaag 
gacgtcaata 
cggactgtga 
tctaattgca 
aaaaaaaat t 
aaatgttccc 
tctcctccgt 
caatcagaaa 
gtaaacagtg 
tatataggcc 
atttttgtac 
accgacataa 
ttcctgtcag 
aataattaaa 



ctagagagga 

gcctcacaga 

acaaaaattg 

actgtgctgg 

acttcatttg 

tgggatcaca 

tcaccttctc 

gacagctagt 

cagggtgata 

cagtaaccgt 

aaggttatca 

ccatacattt 

aactattagt 

aaaatatttg 

tacatggagc 

atttaattct 

tatacatg.ta 



acagcaatgg 

ctcagagcgt 

gaaagctgga 

gttgatttat 

cgaaaaagac 

tgagcaaatt 

atcagattga 

gttttttacc 

tgctgatttt 

gcgattatac 

ttggtcgttg 

c taaaatccc 

aataatatga 

tcttttaeaa 

caaagaaaac 

acccagtggc 

tatatatata 



ataaaaaaac 
gaaaatgaat 
cagccggata 
gctgggcagt 
agggagacag 
ttcagctctc 
aaaaaaaaaa 
atccgtcatt 
cattgtgcac 
aaattatgtc 
agttatggga 
aagtgtagga 
aatatactta 
tttttttcct 
aataatggta 
aaagaatttt 
aaaaaaaaaa 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2641 



<210> 40 

<211> 914 

<212> DNA , • 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 



401801CB1 



<400> 40 

cagaggtgta 

tgaaaacagt 

ttgaagacta 

gagtacatta 

ttattgcatt 

aagtatactt 

gaaagctttg 

tgttctcatt 

aaattttatg 

ttatctatta 

ttgaaacaaa 

catggagtac 

attcagcaca 

attctgcaat 

aaaattgtaa 

atttaaaaaa 



ctctacatga 
aaataactga 
tgagtaatat 
ttctttgtta 
ttccattctt 
aattgactac 
ctgtttcccc 
actttttaca 
taaatttaac 
atatatgtat 
tgatggcaat 
aaaaactcct 
tattatggag 
atatggttat 
tttctt h tatt 
aaaa 



tgcacaaatg 
gaatagcaca 
taatcagcaa 
taggactacc 
aaagcagtag 
ataatgtgat 
ccctttttgt 
aatgagaaca 
ctttattttt 
tttaatttac 
gtcacagatg 
actgttcttg 
ttggatagta 
acaagcagca 
gattgggaat 



tgatatgtct 

aagctactag 

cttagtttgt 

tcacccttaa 

tttagtgttc 

agttaaaaaa 

atataaaagc 

atgccttatg 

aattaatatg 

aggaaaacat 

aattatgggt 

gacatggtca 

gagatgttgt 

tacaggaata 

gtttttctct 



ttttggtgaa 

ggactcagat 

tatcttcagt 

attgtaagtt 

tcttactgta 

tatatttaaa 

attagttgtc 

catgttgtga 

tgttaagata 

ctttatgtat 

t tttaacata 

gcagtatgct 

catgatcata 

ccatatctgt 

ttaataaaat 



gagcacgtga 
gattcaaata 
tatatgggag 
ctttattagc 
tgacaattaa 
gtagcttttt 
attattcatg 
acaattacta 
taacattatt 
ggaggcagaa 
catagtcagt 
gtggagggac 
atatttggat 
gagttactta 
cttcatatga 



60 
120 
180 
24C 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
914 



<210> 41 

<211> 1006 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1721842CB1 



<400> 41 

ggcaaagaga 

ctgaaggtcc 

aagcagttgc 

aggaagcgac 

atgaatgcct 

cacctgcttg 

ttcggtacag 

ggtaaatgag 

actgacccag 

ggtggcttca 

tgacctgact 



actacaaatc 
cggtaagcat 
catggagttg 
tgcggggagt 
aatggtggag 
cacctttgaa 
gagccccctc 
gagccaagga 
tccc tgtggc 
cagctgttgg 
tccattgacc 



ccagcgtcac 
cgcgtcagta 
ctctgagtaa 
atttccattt 
tttcagccat 
tgatggcctg 
ttgactactc 
caggactacg 
tgaataacaa 
agcacccaga 
ctgtcctaac 



ccgcggcctt 
cttatggcgc 
ccctgaggca 
taaccggaaa 
cagtgacagg 
aactatgaac 
ct tcagaagc 
accactgaag 
tgttctcaat 
gaacctggcc 
aactttcttc 



gaagccccgc 
ctgccgggtt 

gtgggacgcc 
caatccctga 
c tgaacccag 
aaacgggact 
atccacgtca 
cgttcaaagt 
gatctgagag 
tggatcgacc 
aacctgagtg 



ccctgacaaa 
gtggtgacga 
aagactggag 
acccacagga 
actcccaggg 
atatgaacac 
ttcaagatct 
cggggaaatc 
acttcaacca 
tgtcctttaa 
tcctctatct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
66 0 
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tcacggcaac agcatccagc gcctggggga ggtgaataag ctggctgtcc. ttcctcggct 720 

ccgtagcctg acactccatg ggaaccccat ggaggaagag aaagggtata ggcaatatgt 780 

gcHgtgcacc ctgtcccgta tcaccacgtt cgacttcagt ggggtcacca aagcagaccg 840 

caccacagct gaagtctgga aacgcatgaa catcaagccc aagaaggcct ggaccaagca 900 

gaatacactt tgaggctccc acgaccctag tagtcctaaa ggcctaagca tagacagcat 9 60 

ggtttgacaa taaataattt gagctgttga gcagaaaaaa aaaaaa 



<210> 42 

<211> 2582 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1833221CB1 



gttaaattta gtactgaatg cagccttttg ttgtttaaaa aatttttttg aactcacagt 60 
ctacatcagc atcagcatct gcgtcaccat ttcaatctgc atggtatagt gaatctghga 120 
taactcaggg agcacgctca agatcgcaga accagcaacg ggatcatgat tcaaaaagac 180 
ctaaactttc ctgtacaaac tgtactacct cagctgggag aaatgttgga aatggtttaa 240 
acacattatc agattcatct tggaggcata gtcaagttcc tagatcttca tcaatggtac 300 
ttggatcatt tggaacagac ttaatgagag agaggagaga tttggagaga agaacagatt 360 
cctctattag taatcttatg gattatagtc accgaagtgg tgatttcaca acttcatcat 420 
atgttcaaga cagagttcct tcatattcac aaggagcaag accaaaagaa aactcaatga 480 
gclctttaca gUgLtaca tcatccacaa accaccaatt gccttctgaa catcagacca. 540 
?Ictaagttc Lgggactcc agaaattctt taagatcaaa tttttcttca ^gagaatcag 600 
aatcttcccg aagcaatacg cagcctggat tttcttacag ttcaagtaga gatgaagccc 660 
caatcataag caattcagaa agggttgttt catctcaaag accatttcaa Q^cttctg 720 
acaatgaagg taggcggaca acgaggagat tgctgtcacg catagcttct *gcatgtcat 780 
ctact?tttt ttcacgaaga tctagtcagg attccttgaa tacaagatca "gaattctg 840 
aaaattctta cgtttctcca agaatcttga cagcttcaca gtcccgtagt aatgtaccat 900 
cagcttctga agttcccgat aatagggcat ctgaagcttc tcagggattt cgatttct^ta 960^ 
ggcgaaga?g gggtttgtca tctcttagcc acaatcatag ctctgagtca gattcagaaa 1020 
a?t?taacca agaatctgaa ggtagaaata caggaccatg gttatcttcc tcactUgaa 1080 
atagatgcac acctttgttc tctagaagga ggcgagaggg, aagagatgaa tcttcaagga 1140 
taccta3ctc tgatacatca tctagatctc atatttttag aagagaatca aat^aagtgg 1200 
ttcaccttga agcacagaat gatcctcttg gagctgctgc caacagacca "agcatctg 1260 
cagcatcaag cagtgccaca acaggtggct ctacatcaga ttcggctcaa ggtggaagaa 1320 
aticaggaa? atcagggatt cttcctggtt ccttattccg gtttgcagtc qccccagcac 1380 
ttgggagtaa tttgaccgac aatgtcatga tcacagtaga tattattcct tcaggttgga 1440 
attlagctga tggtaaaagt gataaaacta aaagtgcgcc ttcaagagat ccagaaagat 1500 
tgcagaaaat aaaagagagc ctccttttag aggactcaga agaagaagaa ggtgacttat 1560 
SagLtttg tcaaltggca gctgcatcat catctaattt gctgatagag ccatgcaagt 1620 
gcacaggaag tttgcagtat gtccaccaag actgtatgaa aaagtggtta caggccaaaa 1680 
?taac?ctgg ttcf tcatta gaagctgtaa ccacctgtga actatgtaaa ^gaagttgg 1740 
agcttaacct ggaggatttt . gatattcatg aactacatag agctcatgca aatgaacaag 1800 
c?gagtatga g?t?atcagc Ltggtctct acctagtggt gttattgcac "gtgcgaac 1860 
aaagcttttc tgatatgatg ggaaatacaa atgaaccaag cacacgtgtc cgatttatta 1920 
accugcaag aactcttcag gcacatatgg aagatctcga aacttcagag gatgattccg 1980 
aagaagacgg agaccataac aggacatttg atattgccta acttcatata a 9 a «9*^g "JJ 
atgatctgtg aacataagtg tttattaaaa atggcaatta aatataaatt acttttgtgg 2100 
gggaatglc? aataaataca ttgactatat ataaaatgaa tatatacata cacatgtatg 2160 
cctgtatata tatattcatt ctccagtgtt gctgaattaa aattctgctg gactttttaa 2220 
catagcaaat ccgatgttta taaactggta atcaaaaagg ttttttcttt taggtgagtg 2280 
ggaaagtatt acccttgttt taaatatcta agcaatgcct atcaaccctt ttttgtgtta 2340 
gattlctgt agtcatlttt atgaaaaaag gtttgtgttt tactcttgct agtgagaaaa 2400 
gfgggacaaa atatactttt gaaataaaat gctatatggc acctaattat "tttctttt 2460 
lalalgcctt aagttgcagt ctcattttga taatcatttg cttccagtgt "aaaaatta 2520 
aaaaaagaat ggggagaagg ttatgagaag agcattatta agtttccaaa tttaatttga 2580 
at 

<210> 43 

<211> 2849 

<212> DNA 

<213> Homo sapiens 
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BNSDOCID: <WO 006838OA2_l_> 



WU UU/D5J8U 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 2041168CB1 



PCT/US00/12811 



<400> 43 
acgtgaccaa 
agtaccagct 
ggagaaggaa 
caaaagaaga 
gcaggtggct 
attgttggga 
aggtcaaccc 
caggcttgag 
ggtccctgct 
cgcagcctgg 
aaacagaaca 
tgttctgggg 
agtatagtat 
gccagggcaa 
aattcagatg 
atgtctgaaa 
tatgcattta 
ctcaatactc 
gtgaatgctg 
atcacttctc 
actgt tacta 
ttttcagcgg 
gaaaatccag 
ctctttaata 
ataaatgata 
tcactttttt 
agtcaccatg 
catatgttcc 
ttgtaaacta 
gttttgaata 
taggtttaag 
tatctcgctg 
attcatcatg 
aaaattgtat 
tcagtttata 
tgaagcatgt 
gtgggatgtg 

aa tggggtag 

aggcatggaa 
tgtggagagt 
ttgaatatta 
ggggtccata 
cctttcacat 
cccttgtcct 
tcccagcact 
tggttataaa 
atattgccag 
tagaagtatt 



gagctgacgt 
ccccactgcc 
attgtcccga 
ggaaggaatt 
ccgcttgaag 
gtggcacacc 
ccagaatcag 
cccagtccca 
attgtcgggg 
tgattggggc 
aggaaaaaat 
ccaggtataa 
ggtacccacc 
gggccagggc 
ataccgtttt 
agccttatat 
acagagatat 
gggatcccat 
aaaatcagcg 
gcttgaactc 
atgagtatca 
gaaatgaaga 
ccatgactag 
agaaggagaa 
atttcaaatg 
tctttttaaa 
attttttggt 
caaagagcca 
ttcattttct 
atgaatatca 
atgga-taaat 
ccta'gattga 
agtaagctcc 
gttaatgqtg 
atctagttgg 
actcattgtc 
ctagtcggct 
ggtggttagg 
gtaggaaccc 
gggagtagat 
tcccattgaa 
tgaaagtctc 
cctagtggtg 
aaggaagtaa 
gtctaagcta 
gcaaggaatg 
gaaaagctat 
cgaagtatag 



gtgcagaagt 
ctgagggcgg 
atccctgcag 
taatcctaat 
cgagggagga 
actagggaaa 
ctcttgtggc 
cagccttcct 
acgattgcat 
tggcgcctgc 
ggctgagggt 
tgactggtct 
ttgggctcgg 
tacccgggca 
gtcccctcaa 
tcttgaagca 
tattcgtgat 
agttaaggaa 
caggcttaaa 
atctgtgcag 
gcacatgctt 
aaccaaactt 
ggaactgctc 
caaagaagtt 
ggaagaaaat 
agaatttcaa 
gaaagtaaaa 
ggaataacac 
ccaccttgtt 
tattgtatca 
gaatatcact 
aatattttgc 
cttctgtcat 
gaaagaaaaa 
agagataaga 
agatgttccc 
aaagcttcac 
gaattccatg 
tcttcctatg 
aattttggaa 
tacccagagc 
tattactgag 
gccatttggt 
ttgaataatt 
gtaaaaattg 
ttcacacagt 
tcattatgcg 
ttccatttt 



ccttcttgtc 
gccggcctgc 
tctttctgta 
cggtggaggt 
agtttcctcc 
agaagaaggg 
cttgaagtgg 
cccccagcct 
gggctacgcc 
tattgcattt 
ggatctgggg 
gatgatgatg 

attgggactg 

cgtcgggctg 
gagctacaaa 
gctttaattg 
ctgggtggtc 
aaggctttaa 
gtatacatga 
cttgctggac 
gctaattcca 
caggttctga 
a gggcccaag 
attcttaaac 
gaacctactc 
gtgtgtgctg 
gttggaaaat 
cttgattttg 
tatatggtaa 
tcaatgctga 
acttgttctg 
tatttcttct 
tttcattgat 
aagaagaaag 
aacgtacaaa 
tctgagagca 
agaggaggtg 
agacaagaca 
acaggagatc 
agctgggtga 
cactaaatct 
tagtgtcaat 
aatacagata 
gcccaaagat 
gaaacaattt 
aggatattat 
ttgtgaagtc 



ctggtcgttg 
ggcggaggga 
ggttgcggca 
cgatttgagg 
gatcagtaga 
gcgaactgct 
c tgaagacga 
gagtgactac 
aggaaagtag 
atagactgac 
a tgtggatga 
atgacagcaa 
aagctggaac 
tccagaaacg 
aggttctttg 
ctctgggtaa 
tcccaattgt 
ttgtcctgaa 
atcaagtgtg 
tgagattgct 
tttctgactt 
aactcctttt 
taccatcttc 
t tctggtcat 
agaatcaatt 
ataaggttct 
tcatggccaa 
taatttagaa 
aggaatcctt 
tatttaactg 
aaaacatgtt 
gcataagtga 
ttaatt tgtg 
aaaaaaacca 
ccaaaagata 
cagaggaggc 
gcaattgaaa 

a ggggggcat 

attctgctta 
agccagttgt 
ttttttacta 
gagggtgtgg 
taagccttaa 
tgtatgtatg 
aagtatctag 
aagtatgctg 
agaaagtaaa 



ttcccgtctg 
aaaaggaaga 
caacgccagg 
gtctgctgta 
gattggaaag 
tgtcttgagg 



tcaccctcca 
tctattcctt 
gctgggtgac 
taggggaaga 
tgctggggac 
tgagagcaag 
cagagctagg 
ggcttccccc 
cttggttgag 
caatgctgtt 
cgcaaagatt 
taacttgagt 
tgatgacaca 
tacaaatatg 
ttttcgttta 
gaatttggct 
actgggctcc 
atttgagaac 
cggtgaaggt 
gggaatagaa 
acttgctgaa 
gcaacacaca 
tcagctgcca 
agttg^tctt 
tgttgctttt 
cagtgaacca 
tatcatcaat 
tccctgtcct 
acagaatatc 
aaaagcttct 
atgagtcctg 
ggtgtgagaa 
gagtggagag 
ggagaattgt 
gaaaataatt 
caaaatggag 
actatgtaaa 
aggctgttca 
cacattggat 
atggaaatct 
aaagggtaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2849 



<210> 44 

<211> 670 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No; 2365794CB1 

<400> 44 

gaggcaagaa ttcggcacga ggggcgccgc gggcatttct tccactgccc gtctgaggga 60 
acgctaagta gtgtgtccgg cgccgtgttc cagctccgcg ttgttccgcg agaaagcgag 120 
aggccgagcc cgggctggtg cgatggccgc ggtggtggcc aagcgggaag ggccgccgtt 180 
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catcagcgag gcggccgtgc ggggcaacgc 
gtcagcgctg tcgggggcca cggccggcat 
cttctacctg ctcgcctccg tcctgctctc 
gtggaacaaa tatttcaaat cacggagacc 
cttcacctac gtcctgttct ggacgttcct 
gggcccgggg gactttttta aaaaaccaga 
atgtagactt ccttagttct taagtggttg 
taatttatat ctgaagacgg agagcctgta 
aaaaaaaaaa 



PCT/USOO/12811 

cgccgtcctg gattattgcc ggacctcggt 24 0 
cctcggcctc accggcctct acggcttcat 3O0 
cctgctcctc attctcaagg cgggaaggag 360' 
tctctttaca ggaggcctca tcgggggcct 420 
ctacggcatg gtgcacgtct actgaaatgg 480 
tcgggaggac tgtggccagc aattaacacc 54 0 
aattcgctgc ttgttctgta acgttataaa 600 
atattcttca gattaaatga agcgtgagac 660 



<210> 45 

<211> 2364 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2618452CB1 



ctcaatgcca acaggcacca ttcctccacc gacaacgctg aaggccacag ggtccaccca 60 
cacagcccca ccaatgatgc caaccaccag tgggaccagc caagcctcaa gctccttcaa 120 
cacagccaaa acctctacat ccctacattc acacacttcc tccacacacc atcctgaagt 180 
caccccaact tctatcacca acatcaccct caaccccacc agtataggaa cctggacacc 240 
cgtggcccac accacctcgg ccaccagcag caggctaacc acacccttca ccacacattc 300 
cccacctaca gggagcagtc ccatctcttc cacaggtcct atgactgcaa catcctttca 360 
gaccLcact faftltacac caccatcaca ccctcagacc acacttccca ctcacgttcc 420 
Icctttctcc acctccttgg tgactccaag tactcacaca gtcatcatca ctacccacac 480 
acagatggcc acttctgcct ccatccactc aacgccaaca ggcaccgttc ctccaccaac 540 
aacgctcaag gccacagggt ccacccacac agccccacca atgacagtga ccaccagtgg 600 
gaccagccal acccacagct cattcagcac agctacagcc tcttcttcct tcatatcctc 660 
ctcgtcttgg tcgtcttggc tgcctcagaa ctctagctca aggccaccgt catcacctat 720 
caccacacaa ctcccccact tgagttctgc aaccactcct gtttccacaa ctaatcagpt 780 
gtcctcctca ttttctccca gtccttctgc cccctctact gtttcttctt atgtgocctc 840 
ctcccactcc tctccccaga cttcatcgcc ttctgttggc acatcttcct ^tttcgtgtc 900 
cgcccccgtg cactccacaa ccctgagctc ggggtcacac tcctcattgt ccactcatcc 960 
clcgactgcl tcagtgtctg catctcctct ttttccttct, tctccagctg cctotactac 1020 
catfagggcc actctccccc acactatctc ctctcctttc accctctctg ctctactccc 1080 
catatcclct gttacggtgt ctcccacccc atccagccac ctagcctcca gcaccat gc 114 0 
atttccgtcc acgcccagga ccacggccag cacccacacc gcccctgcct tctcct^ctca 1200 
gtccaccacc tcgcggtcca cttctctcac cacccgagtt cccacatcag gctttgtgtc 1260 
Ictcacctcg ggggtiacgg gtatccccac ctctccagtc accaacctta ccaccaggca 1320 
ccctggtccc Iccttgtcgc ctaccacacg gttcctgacc agctccctca ctgcccatgg 1380 
aagcacccct gcttctgccc cggtatcttc tctcgggaca cctacgccca cctcacccgg 1440 
ggtctgcagt gtgcgggagc agcaggagga gatcacgttc aaggggtgca tggcgaacgt 1500 
gacggtaacc cgctgtgagg gc'gcctgcat ttccgctgcc agcttcaaca tcatcaccca 1560 
gcaggtggat gcccgctgca gctgctgccg ccccctccac tcctatgagc ^gcagctgga 1620 
gctgccctgc cccgatccca gcacgcctgg ccggcggctc gtactcaccc tgcaggtgtt 1680 
cagccactgc gtgtgcagct ctgtggcctg tggagactag cagggtcgct gcctgctctc 1740 
ctggggctga aggactgcag atgacagaca ggaaaacacc caccagcccc cttcccgctt 18 00 
gtgcclgclg c?gctt?cc? ggtcaccagg cctggccccc aagtgccctg ggccgtggct 1860 
ccctggggca ccggttggag aggggctgcc aagcaggggc tcagactacc acactcctgc 1920 
agacc3?gag ccagcagaga gggactgagg cggacagtgg tcacggacct cccaggcaca 1980 
clgggcactc ccgaccaccc ctgcccactg tccaacacct cccagcccct ^acttggcc 2040 
ccagccctgc tgggcccaga accctgcaga tgaagccaca gagcaggcgc tcgaccagac 2100 
ccatcagggg cgaggagggc acggaaacct gtgccgagat gggggcaaga ggcccaggca 2160 
gcc"acclg1a clglgalgag gagatcccca gagtcaggga gggcagaggg tggcagcgag 2220 
ggcagggcag ccgcccccgc tcccagccag gcagaaggcc cccaccagca ccacacccat 2280 
ccccagcagc ctgtccttgg gagagggcgt cacccggtca gagactccaa ataaaccggt 23 4 0 
tcttgtcaag gcacaaaaaa aaaa 



<210> 46 

<211> 3600 

<212> DNA 

<213> Homo sapiens 

<220> 
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BNSDOCID: <WO 0068380 A2J_> 



WO 00/68380 PCT/US00/12811 

<221> misc_feature 

<223> Incyte ID No.- 2622288CB1 

<400> 46 

ac??ac?o?o ^? CtCCt cctc f c 9 ctc ctcctcggcc tccccttcgg gcgctctcgc 60 
nnit^-lf f ctcctccggg gccctccgcc tgctcccagt catggtggcc tggcgctcgg 120 
t?aataa t ?J ^ 9CC ^ CgCt "ctccttgg ccaccctggt ccagcglgga tllgggga?? 180 
taac ctggag gatgcagtga aagaaacttc ctcagtaaag cagccatggg 240 
caaataStno »£ C " CC ?? a ac ^ aata 9^ caggaaccac cagagctccg gcSaaac??? 300 
^9^99 attggacttg gctgatgctt tggatgatca agatgatggc cgcaggaaac 3 60 
^??^ agg fggaagagag agatggaacc atgtaaccac cacgaccaag aggc?2gtaa 420 
^ a ^ agagc tccagcaaat actttaggaa atgattttga cttggctga? gcSctgJatg 480 
aclaaaatct JS^S?* ggccgca9 ? a aaccaattgc tggaggagga ggtttttcag 540 
ZrmZnnJZ tgaagacata gtagggggtg gagaatacaa acctgacaag ggtaaaggtg 600 
c ggcagcaat gacgaccctg gatctggcat ggtggcagag cctggciSca 660 
ttgccggggt ggccagcgcc ctggccatgg ccctcatcgg tgccgtctcc agcticatct 720 
»an^nl 9Ca gaagaa9ttc tgcttcagca ttcagcaggg tfctcLcgca glcta^Jtga 780 
agggagagaa cctggaagcc gtggtatgtg aggaacccca agtgaaatac tccaegttgc 840 
a ^ g " gtc tgcagagccg ccgccgccgc ccgaaccagc ccggatctga gggccctgtc 900 
ZtfZtr " cat ? cacaat ggtgccaccg cttgtcaccc ggctccccSc ISccttSat 960 
ttggacccgc agctgbtgtg ctgctctgtg ccatcggctc cttgttggtc tgagtttccc 1020 
??^ 9CtC tgggtgtttg tgagtttggt ttctctgccc . tgccccSgc g?g?tgaglc J SI S 
Ji-^S 98 aa " caa 9 a 9 ccagctctga tagaaagcca gcaccagcct cgggagcfgc 1140 
cfalaataat nnl^^f cca9Cct9cc tccagcttta ctgagcacag gffgcggggg ^200 
? a a 9 a gctgaggcct gatgacattt atgcttaggg gacaagagtt tgaacEcIIg 1260 
??S?iIc!at ^'? CaCaC tOTffltMct cattgtggca ggtttctgcc aatagacagc 1320 
^^ gaCagt 99 cc tcaagg agctgcaggt ggggggctca gcctgcaccc acttggagcc 1380 
atoatt 3 ?? 3 ?? 9aa ? Cggt ca 9 caccaa 5 taacaccaca cacacgcagc accclggltj 144 0 
ttcagtcttc cccatcccag gttttatgtt gctgggcttc cggagagccl 1500 
gtccaagcgg aggctttcag tgatttaagt acaaacatgc atctcgtgat agtcttgcct 1560 

at a cc!taoa ^anT^ gg3taa ? tat BtaggSctgt ttJcStcS 1*20 

a 5^" atagg SSSCtSggtg tggccttcgg gttgacatga gaaaggtctt tagcaatcat 1680 
; c C g""? gagatgagtt ttatcctgtg ttggggagag gtgctSeccc tcLclctgt 1740 
acttat?^ tggtagcaag agtgaccgat gtcaagaacg agcatcaaag ccagaatcft 1800 
acttSatt?a atT^T,^ ^g^c ggcgggggag gagaggggaa agagacattc 1860 
?«»™ gtgaaacgca ggtgactttg tagctctgtg gtcagcctac ttgtc'tgctc 1920 

tgagggagag tgcgtgggga gccatgctca ccgtggcaaa cacaggaacc ccltgaltcg 1980 
fr^5f? CC 9gcgtgga 9 c tgcctggttt gggctggagc agagctggtt tcctggaat? 2040 
lllllll* 9 * ccacatat 99 ttctgtcccg gtgagctctg ttgtcagagg ctcalggga? 2100 
agaa ^ acat g^tagggtct agggcccctg tctactgata gtcagtttgc tgtgtSgaa 2160 
llaltlalll aaa 9"5 ata tgagtcacca gacaggcagg atcttacaaa actfacgggc 2220 
ctctttggtc tgcatgatgg ccccatgcgt ttcataggct gtccactgag cgggattgtc 2280 
acatacltc? Ei?' 96 "" ^ cca ^ c ttaaggaaac cactggaltl tgllgcclcl 2340 
^"^"5^ g tctaac 9 ca tgcctcgtgt tcgttttgca aacatgcctg tggtggaggg 2400 
rr?^ I 9 ta ? ccct S t ff cgtctcaagg ctgccttgtg aggccattcc clgtglgtgl 2460 
'"?^, tc cttaccaccc cttttcctgc tcggcccttt aatccctgac agacftggac 2520 
tgtgtggctg aagggggacc tgcagcactg cagaaatgcc tctgcgtggt gccatgaagg 2580 
antnn^ Ct tQ3Cc ^ tc tcgagaagct tcccatgctt caggaSgtta gSagJgtgg 2640 
ggtggcttgc aggattggcc tgtttccagg gcctcccaca ctcattggcc agat™g2 2700 
ctttgtcagg cttgtccctc cctgatacca agtatgtcga gaaccgatgg ccccalcltc 2760 
tggctggtgc tgggccggag gtggctatgg aggattttgg catgcgtggc ctgtcgccac 2820 
tlllat^f tgaCCtcagg Wttgtccac tttaccttta tggtg!gg?c tg?cggatgS 28sS 
ctaagtcctt gaaaccctag agctgtgacg tagaatatgt gctgtctgtg agaccgtgtt 2940 
cccaggagca ctgactgcag ttgagagaga cccattttgc tctlccttaS ciccclcJgc 3000 
ttctgcacaa agcctagagc ctggcactca agcccaccgg tggcagctcc 3 060 
Itnrr?^" a ^ at g cct gg aa 9 acccctc agccttctgt ttgcagaacg ttcatttcag 3120 
gagcttctcc ttcccacaga catcttacac ttgctcgaca ctgccacctg cagaagcctg 3180 
gcgggctctg gtcaccatgt gtctatctga aggttgcact ggccagcatg ggcctgtccc 3240 
aag f gagagg ggagacacag tggactgaaa ggactggttg aaagtggcca atctctatca 3300 
g " gagaaaa tttgtaacaa ctctgagcac atgctgggtg aagtcacagc 3360 
ao 33 ?^ 333 ? ataaa9ct9 9 gcggaaggag gtgtgcgtgg cttctggggt gggacccaga 3420 
ggggaggCtC tgggacaggg gctggggttc agtgccaggg ccctgaggaa gaaatgggga 3480 
ctgatctcaa aattccagaa ttccctgtac atctgttcac gtgcttgtgt ccaggtStga 3540 
cttgtaaact gtctagtgtt tgcattaaat aaaatggcac cgagcagala aaalLIafa 3600 

<210> 47 
<211> 1236 
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BNSDOCID: <WO 0O66380A2_l_> 



WO 00/68380 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No : 



PGT/US00/12811 



2806595CB1 



<400> 47 

ttaatttccc 

tctagccgtc 

accctggcat 

ctacggactg 

aggcaacggt 

ggagaccctg 

gccggccctg 

ggccccagag 

ccaaggccgc 

tctgcggctg 

aagcggtctg 

ggacctgaga 

tttaaaagta 

tacaggtgag 

gtaaacgaca 

acccgctgta 

ttagagacag 

cactgtgaga 

cagggagctc 

ctgccaaatt 

t tatgtaaaa 



cgaaatcaga 
gaggagctgc 
gaggagatgg 
cccttctaca 
catggcaaag 
ttcacctacc 
gtctccccgc 
aaggacgtgc 
gtgcacctgc 
gaggactatg 
gtggagctgg 
gcagagggga 
cttaattctc 
aaaactgagg 
gagctggggt 
attcacatct 
agctgagatt 
cgatgggagt 
ccacactact 
ttctggcttg 
acctcctttt 



ctgctgcc tt 
ctggggacgt 
gcctgttgct 
acggcttcta 
acctccttaa 
aaggggccag 
ggcgtgtgcg 
tggtggccat 
ggcaggacaa 
ggcgttaccg 
agctgcgggg 
gaggaccaga 
aaaacaaccc 
cacagagagg 
ctgaacccaa 
cattcagaga 
taagcaaggt 
ggtggggact 
caggcccagc 
acaagataag 
taaaaacaaa 



ggaccgggac 
ttgccctggg 
cctggtcccg 
ctactccaac 
tggagtgaag 
tgtgatcctg 
tgtcaaatgg 
cgggctgagg 
agagcatgac 
ctgtgaggtc 
tgagatgcta 
gaaaacatcc 
acatggaagc 
tcaagtaact 
gcacccagcc 
gaggaaaacc 
tagcgggtaa 
gggcacactc 
tgatgatgcc 
cttatatttt 
acaaaa 



agctcgcggc 
gccccagcct 
ttgctcctgc 
agcgccaacg 
ctggtggtgg 
ccctgccgct 
tggaagctgt 
caccgctcct 
gtctcgctgg 
attgacgggc 
acggggactg 
agacctctgt 
tactgttgtg 
tacctaaggt 
tctagaatct 
agagctggtc 
cacaagcgaa 
ctgagccctt 
acgccagaat 
tatgtgagat 



ccccgagagc 
ggcccgggtc 
tgcccggctc 
accagaacct 
agacacccga 
accgctacga 
cggagaacgg 
ttggggacta 
agatccagga 
tggaggatga 
ggtgadactg 
gctttagaca 
acccccattt 
cacacagctt 
gttcccctct 
ccacagctta 
tgaggcagcc 
gtcctgtgcc 
gcagacccca 
ctccagattt 



<210> 48 

<211> 3 081 

<212> DNA 

<2 1 3 > Homo sapiens 

<220> 

<221 > misc_f eature 

<223>> Incyte ID No: 2850987CB1 



<400> 48 

gcccggccca 

tcggacgacc 

ctacagtgtt 

ttgttctttg 

gtgc tatc tc 

gatagcagcc 

caaccgccag 

tctctacctt 

tgtgtttctg 

ggtggcaggg 

tagacttaaa 

aatcatcctg 

ggttctcact 

tactgctact 

aagcaaacta 

agcccccatg 

tatttcggga 

ccagaaaacc 

acagctgaat 

tgtgaaagac 

gatggctgag 

gtttctggct 

caccacaggg 

agcatcagct 

tggtttggtt 

tgtgatc tct 

tatccgatca 



cggcggcggc 
tgctgagcct 
agctaagttt 
ccaaagtcgt 
cagt tccttg 
gcagaagagc 
agaggatgct 
ggtcattctc 
gtagagctct 
tctgttctgg 
gtggcccaga 
atgatggttt 
tcctgctata 
gcaatcacaa 
gcaaatatga 
gctgttggcc 
tggaacttgg 
ccagctctag 
ttacacaaag 
tctaacatcc 
ccct tccgta 
ggcatgggtc 
tacgcctaca 
ataactggaa 
cggacaggtc 
gtat tcatgc 
aggt teat tc 



ggcggcggcg 
cccaaaccgc 
ggaaagaagg 
cgttgtagtc 
cactcctgtt 
cageggggtc 
gtggatcc tt 
tctctacttg 
atggaaacag 
tcctgggagc 
cctcgctggt 
tcttacataa 
tcctgatcat 
tccaaaggga 
atgccacaat 
agattatgac 
tatccatgtg 
ctgtgaaagc 
atactgagee 
atgagcttga 
cct tccgaga 
ttgetttect 
ctcagggact 
taatgggaac 
tgatctcagg 
ctggaagccc 
aaggagagtc 



gagagagctg 
ttccataagg 
aaaaaagaaa 
tt t ttgecca 
aacaagcacc 
gcctagtgtc 
ggccgactac 
gggagategg 
cctccttttg 
catcateggt 
ggtacagaat 
acatgagctt 
cactattgea 
ttggattgtt 
acgaaggatt 
atttggctcc 
cgtggagtac 
tggtcttaaa 
aaaacccctg 
acatgagcaa 
tggatgggtc 
ttatatgact 
gagtggttcc 
tgtagctttt 
at tggcacag 
cctggacttg 
aat tacacct 



getcagggeg 
etttgecttt 
atccctgggc 
aggctgttgt 
teagegagag 
atgaccaggg 
ctgacctctg 
atgtggcact 
acagcagtct 
gactgggtgg 
gtttcagtca 
ctgaccatgt 
aatattgcaa 
gttgttgcag 
gaccagttaa 
ccagtcatcg 
gtcctgctct 
gaagaggaaa 
gagggaactc 
gagectaett 
tcctactaca 
gtcctgggct 
atcctcagta 
acttggctac 
ctttcctgtt 
tccgtttctc 
accaagatac 



tccgctaggc 
ccaacttcag 
cccttttctt 
gtttttagag 
cagcagcagc 
egggagatea 
caaaattcct 
ttgcggtgtc 
acgggctggt 
acaagaatgc 
tcctgtgtgg 
accatggatg 
atttggccag 
gagaagacag 
ccaacatctt 
gctgtggctt. 
ggaaggttta 
ctgaattgaa 
atctaatggg 
gtgcctccca 
accagcctgt 
ttgactgeat 
ttttgatggg 
gtcgaaaatg 
tgatcttgtg 
cttttgaaga 
ctgaaattac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1236 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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aactgaaata tacatgtcta atgggtctaa ttctgctaat attgtcccgg agacaagtcc 1680 
tgaatctgtg cccataatct ctgtcagtct gctgtttgca ggcgtcattg ctgctagaat 1740 
cggtctttgg tcctttgatt taactgtgac acagttgctg caagaaaatg taattgaatc 1800 
tgaaagaggc attataaatg gtgtacagaa ctccatgaac tatcttcttg atcttctgca 1860 
tttcatcatg gtcatcctgg ctccaaatcc tgaagctttt ggcttgctcg tattgatttc 1920 
agtctccttt gtggcaatgg gccacattat gtatttccga tttgcccaaa atactctggg 1980 
aaacaagctc tttgcttgcg gtcctgatgc aaaagaagtt aggaaggaaa atcaagcaaa 2040 
tacatctgtt gtttgagaca gtttaactgt tgctatcctg ttactagatt atatagagca 2100 
catgtgctta ttttgtactg cagaattcca ataaatggct gggtgttttg -ctctgttttt 2160 
accacagctg tgccttgaga actaaaagct gtttaggaaa cctaagtcag cagaaattaa 2220 
ctgattaatt tcccttatgt tgaggcatgg aaaaaaaatt ggaaaagaaa aactcagttt 2280 
aaatacggag actataatga taacactgaa ttcccctatt tctcatgagt ' agatacaatc 2340 
ttacgtaaaa gagtggttag tcacgtgaat tcagttatca tttgacagat tcttatctgt 2400 
actagaattc agatatgtca gttttctgca aaactcactc ttgttcaaga ctagctaatt 2460 
tatttttttg catcttagtt atttttaaaa acaaattctt caagtatgaa gactaaattt 2520 
tgataactaa tattatcctt attgatccta ttgatcttaa ggtatttaca tgtatgtgga 2580 
aaaacaaaac acttaactag aattctctaa taaggtttat ggtttagctt aaagagcacc 2640 
tttgtatttt tattatcaga tggggcaaca tattgtatga agcatatgta gcacttcaca 2700 
gcatggttat catgtaagct gcaggtagaa gcaaagctgt aaagtagatt tatcacacaa 2760 
tgactgcata cagacttcaa atatgtcaat agtttggtca tagaacctag aagccaaaag 2820 
ccacacagaa gggcaagaat cccaatttaa ctcatgttat catcattagt gatctgtgtt 2880 
gtagaacatg agggtgtaag ccttcagcct ggcaagttac atgtagaaag cccacacttg 2940 
tgaaggtttt gttttacaaa tcacttgatt taacacactc aggtagaata tttttatttt 3000 
tactgtttta tacccagaag ttatttctac attgttctac agcaagaata ttcataaagg 3060 
tggaccttgc aagtgcgtat a . yy 308 ^ 

<210> 49 

<211> 1825 

<212> DNA 

<213> Homo sapiens 

<220> " 

<221> misc_feature 

<223> Incyte ID No: 3557211CB1 

<4 00> 49 , / ; 

cgtgttgaag gcatcagacc ctgacactga ggacgatcag ataatcttta aaattctaca 60 
aggcccaaaa catggacatc tggagaacac aacaacaggt gaatttatcc atgagaaatt 120 
tagccaaaag gacttaaaca gtaagactat tctttacatc ataaacccat ctttggaagt 180 
aaattcagat accgtggaat ttcaaatcat ggaccccaca gggaactcgg ccactcctca 24 0 
aattttggaa ctgaagtggt ctcatattga atggtcacag accgaatata tctgtgagaa 300 
tgtgggtttg ttgcccttgg aaattatcag aaggggatat tccatggact cggcctttgt 360 
gggtataaag gtcaaccaag tgtcagctgc agttggaaaa gatttcaccg tgattccatc 42 0 
taaactgatt cagt,ttgacc caggaatgtc aactaagatg tggaatatag caattaccta 480 
tgacggatta gaggaagatg atgaggtctt tgaagtaatt ctgaactccc ctgtgaatgc 54 0 
agttcttggc acaaagacaa aagctgcagt gaaaattttg gactcaaaag gaggacaatg 600 
ccatccttca tattcctcca accaaagcaa gcacagcaca tgggagaagg gcatttggca 660 
tctgctgccc ccagggtctt cctcatccac cacttctggt tcctttcatc tggaaagaag 720 
acctcttcca tcttccatgc agctagcagt catcagggga gacaccctgc ggggctttga 780 
ttctacagat ctttctcaaa ggaagcttag gacccgtggg aatggcaaaa cagttcgtcc 84 0 
atcctctgtt tatagaaatg gaacagacat catctataat tatcatggga tagtttcctt 900 
gaaactggag gatgacagtt tcccaactca caaaaggaag gccaaagtat ccatcattag 960 
tcagccacaa aagacaatca aagtggcaga actgcctcaa gcagataagg tggaatccac 1020 
aactgactca cacttcccca gacaggacca gttgccctca tttccaaaga actgcactct 1080 
ggaattaaag ggactcttcc attttgaaga aggcatccag aagctgtatc agtgcaatgg 1140 
gatcgcctgg aaagcctgga gtccccaaac caaggatgtg gaagacaaat cctgtccagc 1200 
cgggtggcac cagcactcag gctactgtca catcttgatc acagagcaga aaggcacctg 1260 
gaatgcggct gcccaagctt gcagggaaca atacctgggc aaccttgtaa ctgtattctc 1320 
caggcagcac atgcggtggc tctgggacat tggtgggaga aagtcctttt ggataggttt 1380 
gaacgaccaa gtgcatgctg gccactggga gtggatcggt ggtgaacctg ttgccttcac 144 0 
caatgggaga agagggccct ctccacgctc caagcttgga aagagctgtg ttttggttca 1500 
aagacaaggg aaatggcaaa caaaagactg taggagagcc aaacctcata attatgtgtg 1560 
ttccagaaaa ctctaaatat aacagaccct acagggggcc acctggagtt tgtcacctat 1620 
ttattcacag gatctgtgaa tattgctcca tagaaaacaa attgttatga ttgagtgggt 1680 
atacctttgt gattctgtct agtgaaaatg ggacattttt aatagtgcca gaaagattga 1740 
taaataaata ttttttacaa gataagatac aatttttgta tctcaatacc ttttaaaata 1800 
_aatgccagca gtattaaaaa aaaaa 1825 
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<210> 50 
<211> 1712 
<212> DNA 

<213> Homo sapiens ^ 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4675668CB1 

<400> 50 

ctttcttcag tccccacgtg cgatccttcc 
tcaaggcggc ccgtggggtg gggggtcagg 
tggctgggaa tgcggtgcgg gcgggggtcc 
aagcggagga agagtatgtg gggccccggc 
agcaacagga ggaactcgag gccgagcatg 
aacgcaccac gcggctgggt ccaagaatgc 
agtggcccac cctggagaag gctgccacaa 
ttgtggaccc tgaggatgag cgtgccatag 
ggcgcaccct ggctgacatc atcatggaga 
cagtcatgtc agaggtgtcg ggcttcccta 
tgtacagggg ggtccgggag gtattatcta 
ttaagatcat ccctgcactc tccaactggg 
cctggactgc agctgccatg taccaggcca 
gcatggccca gcgcttctac aaccttgtcc 
aatacaaacg actcaacttc catctctaca 
gagcctggtt caaagggatc ctgattccac 
aagccatcat tgtgggtagc atcatcacca 
cggccatgct gaaaattgct gagatggaat 
tgctgctgga taagaagtat gcactgcctt 
tcctggggtt ccggacagag aagcgtgaac 
ctttggtcca gcgctacaag gccgacttgg 
tgctccggct gcagccccat ccacagctat 
cagtcccccg cgatgtggaa gatgttccca 
tcctggccaa aggggtttgg aaggacacca 
.gagctttaat ggctgaagac ccagatcagg 
tcccagtcca ggacaggaag gactgagggt 
ttatccctgt ttagttctga gagccaactt 
cagctggggc ttggtgtgag tactttttct 
aggcaaagac agatatttat tgaaaaaaaa 



cggcaacttt ttcgagaaaa atgcccaaat 60 
aaaaacatgc gcccctggcc gatcagatcc 1^0 
gggagaagcg gcggggtcgc gggacaggag 180 
tgagccgacg gattttgcag caagcacggc 24 0 
ggactgggga caagcccgcg gcgccgcggg 3 00 
ctcaggatgg atcagatgac gaggacgagg 360 
tgacagcagc gggccatcat gcagaggtgg 420 
agatgttcat gaacaagaac cctcctgcca 480 
agctgactga gaagcagaca gaggttga^a 54 0 
tgccccagct ggacccccgg gtcctagaag 600 
agtaccgcag tggaaaactg cccaaggcat 660 
agcaaatcct ctacgtcaca gagccggagg »720 
ccaggatttt tgcctctaac ctgaaggaac 780 
tgctccctcg agtacgagat gacgttgctg 840 
tggctctcaa gaaggccctt ttcaaacctg 900 
tgtgcgagtc tggcacttgt accctccggg 960 
agtgctccat ccctgtgttg cactccagtg 1020 
acagcggtgc caacagcatc ttcctgcgac 1080 
accgggtgct ggatgcccta gtcttccact 1140 
tgcctgtgct gtggcaccag tgcctcctga 1200 
ccacagacca gaaagaggcc ctcttagaac 1260 
cgcccgaaat caggcgtgag cttcagagtg 1320 
tcaccgtgga gtgaggaaaa cagtcagctg 1380 
agacccccgt tggtgactga agatgacact 144 0 
gcagtgacag atcacaggga catctgtggc 1500 
ctggctggtt ccctcttcca .ttctaggccc 1560 
gagataccat atgctagcat tcccagtcpc 1620 
atggctattg tgtcaggtca ctgt^gataa 1680 
aa 1712 
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Continuation of Box 1.2 
Claims Nos.: 18, 19, 21, 22 



Present claims 18, 19, 21 and 22, directed to agonists and antagonists 
relate to an extremely large number of possible compounds. Support within 
the meaning of Article 6 PCT and/or disclosure within the meaning of 
Article 5 PCT is not to be found, however, for any specific example of 
the compounds claimed. In the present case, the claims so lack support, 
and the application so lacks disclosure, that a meaningful search over 
the whole of the claimed scope is impossible. Consequently, no search has 
been carried out for claims 18, 19, 21 and 22. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-23 partially 

Polypeptide comprising SEQ ID N0:1, variants and fragments 
thereof, antibody binding to it; polynucleotide of SEQ ID 
N0:26 t variants thereof, cell and transgenic organism 
comprising the same; probes derived from the polynucleotide 
and use thereof in a diagnostic method; pharmaceutical 
composition comprising the polypeptide and its therapeutic 
use; use of the polypeptide in screening assays for 
agonists, antagonists and compounds capable of altering- the 
expression of the polynucleotide; therapeutic use of the 
agonists and antagonists. 

2. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:2 and 27 

3. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:3 and 28 

4. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:4 and 29 

5. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:5 and 38 

6. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:6 and 31 

7. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:7 and 32 

8. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:8 and 33 

9. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:9 and 34 
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21. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:21 and 46 

22. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:22 and 47 

23. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:23 and 48 

24. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:24 and 49 

25. Claims: 1-23 partially 

Idem as subject 1 for SEQ ID N0s:25 and 50 
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